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STORAGE  PROBLEMS  WHEN  DEMAND  IS  "ALL  OR  NOTHING”* 


I)  P.  (Saver  and  P  A  Jacobs 

Department  <»/  Operations  Research 
Sava  I  Postgraduate  School 
Monterey.  (  ah  forma 


K  IIS  1R  \(  I 


An  mvcnlorv  ol  plnsical  goods  01  sioragc  sp.uv  (in  .1  iommuiiu..»iiuns  s\s- 
torn  butler.  lor  instance)  often  experiences  "all  01  noiliing"  deniaiul  il  .1 
ilernaml  of  random  si/e  /)  can  he  niimediat  *l\  and  ennrelx  lilled  from  slock  it 
is  satisfied,  bin  otherwise  11  vanishes  Probabilistic  pioperties  ol  the  icsultmg 
mxenlory  level  are  discussed  analv  ticallv .  both  I’oi  the  single  hullei  and  I < m 
multiple  hullei  piohlents  Numenc.il  icsulis  aie  piesented 


1.  INTRODUCTION 

The  usual  storage  or  inventory  problems  involve  demands  imagined  it'  occur  randomly, 
and  to  be  capable  of  reducing  any  available  stock  to  zero,  or  even  beyond,  when  backordering  is 
permitted.  Yet  in  many  situations  at  least  one  component  of  total  demand  is  "all  or  nothing." 
that  is,  it  reduces  inventory  only  if  it  can  be  entirely  satisfied  by  the  inventory  present,  and  oth¬ 
erwise  seeks  another  supplier.  Here  are  examples 

(a)  A  manufacturer's  warehouse  is  filled  with  a  certain  item  at  the  beginning  of  the  sel¬ 
ling  season;  let  /  denote  the  initial  inventory  Suppose  that  demands  occur  as  follows:  a  mes¬ 
sage  is  sent  requesting  that  /),  items  be  shipped  from  inventory,  but  only  if  the  entire  order  can 
be  filled.  That  is,  the  demand  is  satisfied  if  />,  ^  /.  in  which  case  inventory  level  is  reduced  to 
10)  =  /  I),.  while  if  /> ,  >  /  the  inventory  remains  unchanged  and  /(I)  -  /  Allowing  for 
no  replenishment,  the  second  demand,  of  si/e  /)>.  interacts  with  inventory  /(I),  so  that  it  is 
filled  if  />,  ^  /(II.  but  is  not  placed  if  /)  >  >  / ( 1 )  The  process  continues  along  these  lines 
until  the  selling  season  is  over  and  there  tire  no  more  demands 

(b)  A  butler  storage  device  used  to  contain  messages  prior  to  their  batch  transmission 

has  capacity  /  Messages  of  length  (/>,  i  =  1.2.  .1  approach  the  buffer  successively,  and  are 

admitted  on  an  "all  or  nothing"  basis,  just  as  was  true  of  demands  for  physical  inventory  in  (a) 
above  Once  again  rejection  will  occur,  and  more  frequently  to  large  demands  (messages)  than 
to  short  ones 

(c >  A  system  of  many  buffer  storage  devices  is  used  to  contain  messages  prior  to  their 
batch  transmission  l  ach  butler  has  capacity  I  Messages  of  length  |/)  .  i  1.2,  |  approach 
the  device  and  are  successively  admitted  to  the  first  buffer  until  there  is  a  demand  that  exceeds 
its  remaining  capacity  The  first  buffer  is  left  forever  and  the  demand  that  exceeds  the  first 

I  Ills  I  Csc.ll  y  !l  W  .|s  slippof  I  i’ll  h\  Hu*  \. uimi.ll  Su.-lUf  I  Mil  ul.llli"!  lliuk'l  \S|  I  Nl  I  l  I’ll  I  't  I  \  *M  1  '!  1 1 )  J  |  u,|  \K  s 
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buffer,  plus  successive  demands,  applies  to  the  second  buffer  until  one  occurs  that  exceeds  the 
remaining  capacity.  This  demand  then  applies  to  the  third  buffer,  and  so  on.  As  a  result  there 
will  be  some  unused  capacity  in  each  buffer.  For  a  similar  problem  see  the  paper  ol'  Coffman. 
Hofri,  and  So  (21 .  For  related,  although  not  identical  formulations,  see  Cohen  [31,  Gavish  and 
Schweitzer  [6],  and  Hokstad  17] . 

In  Section  2  we  will  discuss  some  models  for  the  situations  in  Examples  (a)  and  (b).  We 
compute  such  items  as  the  distribution  of  the  amount  of  inventory  left  at  some  time  t  and  the 
distribution  of  the  times  of  successive  unsatisfied  demands. 

In  Section  3  we  next  consider  a  model  for  Example  (c),  and  derive  equations  for  the  lim¬ 
iting  distribution  of  used  capacity  of  a  buffer  and  the  expected  used  capacity  of  a  buffer  It 
seems  to  be  difficult  to  obtain  simple  analytic  solutions  to  these  equations,  but  certain  illustra¬ 
tive  numerical  results  are  provided. 

2.  THE  ONE-BUFFER  INVENTORY  PROBLEM 

Suppose  that  demands  for  available  stock  occur  according  to  a  compound  Poisson  process: 
if  N,  is  the  number  of  demands  that  occur  in  (0,  /],  then  \N,:i  >  0)  is  a  stationary  Poisson 
process  with  rate  X;  the  sizes  of  successive  demands  { Z),  1  are  independent  with  common  distri¬ 
bution  F.  Assume  that  there  are  no  replenishments  of  inventory.  Let  [I,,  t  >  0)  denote  the 
stochastic  process  describing  available  inventory  at  time  r.  and  let  {/(«);  //  =  0. 1,  . . . )  be  the 
stochastic  process  of  available  inventory  following  the  /ith  demand.  It  is  apparent  from  our 
assumptions  that  both  { /, }  and  (/(«))  are  Markov  processes. 

2.1  Functional  Equations  for  the  Amount  of  Available  Inventory 

Let 

(2.1)  <M.v.r>  =  E\c  ',rl 

be  the  Laplace  transform  of  the  available  inventory  at  lime  i.  Similarly,  let 

=  E[  '''"’l. 

Properties  of  the  available  inventory  can  be  studied  in  terms  of  <t>  and  ib.  It  may  be  shown  by 
using  conditional  expectations  that  satisfies  the  following  differential  equation. 

(2.2)  ~  =  \F  [c  '''  f'  (<’"-  I)  F ( d.x ) I . 

dt  1  J"  ‘ 

Further,  i h  satisfies  the  following  difference  equation 

(2.3)  i Ms.  n  +  I)  =  iM.v./i)  +  F  [<■  '/UI  f""'  (<■"  -  1 )  /  ( r/v ) ] . 

Differentiation  with  respect  to  v  at  s  =  0,  or  a  direct  conditional  probability  argument,  now  pro¬ 
duce  equations  for  F ! /, I  and  F |/(«)1: 

(2.4)  4  /  (A1  =  -A/f  f  v  FUix)\ 

<h  1 J (|  • 

and 

F |/(/l  +  1)1  =  /:[/(«))  -  /.  [j/  \  /  ((A>] 

In  general,  no  explicit  solutions  for  the  expected  values  are  available,  but  a  simple  lower 
bound  results  from  rewriting  (2.4)  as  follows 
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(2.5) 


~A 


Fidx ) 


^  ~\El/,FU,)] 
>  ~\FU)  £'[/,]. 


from  which  one  sees  that 

(2.6)  £(/,]>/  exp  [-\F(l)t] 

and  similarly 

£(/(«)]  3s  /[I  -  £(/)]". 

so  the  expected  available  inventory  declines  by  ai  most  an  exponential  rate. 


2.2.  Explicit  Solution  When  the  Demand  Distribution  is  Uniform 


Although  Equation  (2.2)  seems  to  be  quite  intractable  for  most  demand  distributions,  it 
can  be  solved  completely  when  A' is  uniform: 

0  <  v  c. 
c  >  v 


Fix)  = 


(2.7) 


In  this  case  (2.2)  can  be  expressed  as 

-  A  F 

[<■  ''  f  (<■'*  -  1)  — 

di 

=  A  E 

i  - ,  <• J 1 1 

sc 


Y  ds  ' 


In  other  words,  <t>  satisfies  a  first-order  (quasi)  linear  partial  differential  equation  with  ini- 
'.  Standard  procedures  (Sneddon  [81 )  easily  yield  the  solution 

1  -  exp  l~(s  f  ( A / 1  )  r )  / 1 
.s  +  (A/ c )  l 


tial  condition  <Ms.  0)  =  c 
(2  8) 


which  gives  the  desired  transform.  Passage  to  the  limit  as  x  —  ()  in  (2.8)  shows  that 


(2.9) 


/  [/  i 


exp  |-^(A  t/<  )  /] 
(A/<  )/ 


This  formula  can  also  he  derived  by  first  finding  an  expression  for  the  Ath  moment  of  /,,  and 
then  employing  a  Taylor  series  argument 


In  order  to  invert  the  transform  in  (2.8)  note  that 

,  C 1  >.  »i  .  ^  i  1  <t>  (  s,i )  I  exp  I  -  ( v  +  (A  //<  )  )/l 

\  +  (A/c)f 

Thus,  by  the  unicity  theorem 

for  Laplace  transforms. 


Jn 


I  \  i  \  I  in 


which  is  the  transform  of  a  truncated  exponential  distribution 
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(2.1D 


/’(/  >  v|  = 


expl  (A  l/c) A  ] 
0 


0  A  <  /. 
/  ^  V. 


Note  thai  i he  distribution  of  /  is  absolutely  eoniinuous  in  (he  inierval  (0.  /)  but  that  there  is  a 
jump  at  /  corresponding  to  the  occurrence  of  no  demand  less  than,  or  equal  to.  /  in  (0.  (]: 

(2.12)  /’I/.  =  /}  =  exp  t— A/(//e)). 


2.3.  The  Fx peeled  Number  of  Satisfied  Demands 


Supposing  that  an  initial  inventory,  or  storage  capacity,  /  prevails,  it  is  of  interest  to  com¬ 
pute  the  probability  that  a  demand  is  satisfied,  and  the  expected  number  of  demands  satisfied  in 
an  interval  of  length  t  First  notice  that  if  a  demand  of  size  /)(/)  appears  at  time  r,  at  which 
moment  /  is  available,  then 


H\ /)(/)  <  /!/,)=  /(/,) 


is  the  conditional  probability  that  the  demand  is  satisfied.  When  /"is  uniform,  as  is  presently 
true,  we  may  remove  the  condition  to  find  that 


r\nu)  sg 


e 


1  -  exp  [-(Ar/< )/] 
A  i 


If  SU)  is  the  number  of  demands  satisfied  during  the  time  interval  (0,  r|.  then  since  demands 
arrive  according  to  a  Poisson  process  with  rate  A, 


(2.13) 


/.[.>( r ) I  =  a  r  /;|/(/„)l</,,-A  f,|-exP|-(A»/{-)/| 
Jn  J o  A u 


Ju 


—  y  +  In 


A  1 1 


+  Z  i 


A  1 1 


where  /.',( ■)  is  an  exponential  integral;  Abramowitz  and  Stegun  |1),  and  y- 0.5112. 
F.uler's  constant 


2.4  The  Time  of  the  First  Unsatisfied  Demand  and  the  Amount 
of  Unused  Inventory  at  that  Time 

As  before  /'is  the  common  distribution  function  of  the  successive  demands.  Now  let  t 
be  the  time  of  the  first  unsatisfied  demand.  Then 

r\r  >  r|.Y  =  n  |  =  /»(/),  I.  D:  S?  /  -  A), . £>„</-/),- - D„  ,| 

-  /""(/) 

where  /"’ " ’  denotes  the  /it h  convolution  of  /  with  itself.  Hence, 

(2.14)  /Mr  >  r)  -  T  <■  *'— ~  / 1 '"(/). 


Fxplictt  expressions  for  the  distribution  of  t  can  be  obtained  in  some  cases  If  /  is  uni¬ 
form  on  |0.  c  l  with  c  ^  /,  then 


2  15) 

2c 

-c 

V 

■ 

2 

Kll 

i  : 

1 

( 
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where  /tl(- )  is  a  modified  Bessel  function  of  the  first  kind  of  the  zeroth  order.  In  this  case. 
(2.16)  £[r]  =  y  exp{//c)  =  exp{//2f[Z>)}. 

A  A 


If  £is  exponential  with  mean  l/p,  then 

(2.17)  P[t  >  r)  =  £  e ^e-^iSaJ}L 

“  n 1  “  i- 1 

n-0  "  •  *-«  K  ■ 


and 

(2.18) 


£[r]  -  f  [1  +  pl]  =  f 
A  A 


i  + 


/ 

£U>] 


Note  that  if  /  is  small  relative  to  £[/)],  then  the  expected  lime  to  first  unsatisfied  demand 
when  £is  exponential  will  be  greater  than  the  expected  time  when  £is  uniform.  However,  for 
/large  relative  to  £[/)]  the  expected  time  for  Fexponenlial  will  be  less  than  the  expected  time 
when  £is  uniform. 

Let  Y„  be  the  amount  of  inventory  present  at  the  time  of  the  Mh  unsatisfied  demand. 
Then  for  0  ^  a  <  / 

(2.19)  £(K|  >  /  -  a)  =  f'  Ridv)  £(/  -  v) 

J  0 

where 

(2.20)  Riy)  -  £  F'"'(y) 

,,-t) 

and 

(2.21)  £(/-.v)  =  1  -£(/-  y>. 


Again,  explicit  expressions  for  the  distribution  of  L,  can  be  obtained  for  some  distribu¬ 
tions  £  If  £is  uniform  on  10.  c)  for  c  >  /,  then 

(2.22)  P{  Y,  2  I  -  a)  -  1  - 

If  £  is  a  truncated  exponential 


exp  | 

1 

—  a 

c 

(2.23) 


Fix) 


x  s:  /, 


A-  ^  /, 


then 


(2.24)  P I  K,  >  /  -  a)  -  I  -  [e  ""  -  c  "1  11  -  <■  "1  '  exp  lMe|l  -  c  "1  ')• 

If  £has  an  exponential  distribution  with  mean  \/p.  then 

(2.25)  P I  I,  ^  /  -  «)  =  c  "" 

In  this  last  case,  the  distribution  function  of  can  be  computed  by  induction  quite  easily  and 

(2.26)  £(  )„  >  /  -  a)  »  r 

‘ 
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Hence,  when  F  is  exponential 
(2.27)  E[Y„\ - —  [1  - 

In  principle,  similar  results  can  be  obtained  for  other  distributions,  but  we  have  found  no  sim¬ 
ple  expressions. 

2.5.  Inventory  Costs  and  Policies 

There  are  at  least  three  monetary  quantities  which  affect  the  profitability  of  an  inventory 
policy  over  a  fixed  interval  of  lime  (0,  /]:  the  selling  price,  p,  the  storage  cost,  a,  and  the  cost 
of  lost  demands,  b.  If  the  storage  cost  a  is  charged  just  on  the  basis  of  I  (something  like  ware¬ 
house  size)  then  the  total  expected  profit  in  (0,/J  is 

ZU)  =  p(l  -  £[/,])  -  al  -  bSU) 

—  (p  —  a)l  -  p  —  r)  (l  -  exp[ —  (Xr/c)/J) 

c  | 

£'MI 

for  the  case  of  uniformly  distributed  demands;  see  ( (2.9)  and  (2.13)).  One  can  numerically 
find  the  maximum  expected  profit  for  this  case;  nothing  explicit  seems  to  be  available. 

3.  THE  MANY-BUFFER  STORAGE  PROBLEM 


-  b 


y  +  In 


T7) 


In  this  section  we  will  study  a  model  for  the  situation  of  Example  (c)  in  Section  1.  Mes¬ 
sages  are  successively  admitted  to  the  mh  buffer  until  there  is  a  message  length  that  exceeds 
the  remaining  capacity  of  the  buffer.  The  total  amount  of  this  message  is  put  in  the  in  +  l)sr 
buffer  and  the  mh  buffer  is  left  forever.  Successive  messages  are  then  put  in  the  in  +  l)s/ 
buffer  until  there  is  a  message  whose  length  exceeds  the  remaining  capacity  of  the  in  +  l)sr 
buffer;  this  message  is  put  in  the  in  +  2)  nd  buffer  and  so  on. 

Let  /  denote  the  common  capacity  of  the  buffers  and  D,  denote  the  length  of  message  i. 
Assume  { £>, )  is  a  sequence  of  independent  identically  distributed  random  variables  with  distri- 

oo 

bution  F  having  a  density  function  /such  that  fix)  ^  d  >  0  for  x  €  [0,  /].  Let  R(x)  -  £ 

n- 0 

Fin)  (x)  be  the  renewal  function  associated  with  F.  If  FU)  <  1,  then  we  will  assume  that  an 
incoming  message  to  the  currently  used  mh  buffer  of  length  greater  than  /  is  sent  to  the 
in  +  Dsr  buffer,  when  it  cannot  fit  into  the  ( n  +  l)sr  buffer,  then  it  is  "banished,"  i.e.,  sent  to 
some  other  set  of  buffers.  The  next  message,  however,  will  try  to  enter  the  (n  +  l)sr  buffer. 
If  this  message  has  length  greater  than  /  it  is  banished  and  the  following  message  will  try  to 
enter  the  (n  +  l)sr  buffer;  all  messages  of  length  exceeding  /  will  be  banished  until  one  appears 
that  is  smaller  than  /  and  it  will  be  the  first  entry  in  buffer  (n  +  1). 

This  model  has  been  studied  for  demand  distributions  Fwith  FU)  —  1  by  Coffman  et  al 
(2).  Their  approach  was  to  study  the  Markov  process  describing  the  total  amount  of  inventory 
or  space  consumed  in  successive  buffers  or  bins.  Here  we  study  the  process  |L„),  where  L„  is 
the  size  of  the  demand  that  first  exceeds  the  remaining  capacity  of  the  mh  buffer; 
1,2,  .  .  .)  is  a  Markov  process.  Let 

K(x.  lO.vl)  -  P{L„^  ^  y\Ln  -  *|. 
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Note  that 


P(Z.,  <  y\  =  A< 0,  (0,  i|) 

is  the  same  as  the  sum  of  the  forward  and  backward  recurrence  times  at  time  /  for  a  temporal 
renewal  process  with  interrenewal  distribution  F.  see  Feller  15).  Thus  for  r  sj  / 

(3.1)  //,  (v)  =  P|Z.,  5=  .»•}  =  f'  Rid:)  |/(r)  -  FU  -  r)]. 

Note  that  for  y  <  I 


(3.2) 


A' (  \.  [0.  rl)  = 


f  A(<£)  |A(i  )-/(/  -  v  -  ;)] 

•J  f  \  t 

if  .v  <  /  -  .v  ; 

c /  ' 

R  ((/r) \l ■'(»•)—/••(/  -  a  -  r )] 

J II 

if  /  -  t  «S  a  <  F. 

f  «(</;) \F(v)-F(l  -  r)| 

^  /  » 

if  A  >  /. 


Hence. 


(3.3)  A(.v,  ./»  ) 


[A(/  -  a  )  -  RU  -  a  -  r ) I  P(</v)  if  v  <  /  -  ,r. 

A</  —  a  )  /••(</»•)  if  /  >  a  >  /  -  r. 

/f(r>  /  (</r )  -  f  «(</.-)  /  <  i-  -  ;)  +  «(</.)/(  i  ) 

if  a  —  /  ). 

[«(/)-  A(/  -  .1  )  I  /••(</»•)  if  A  >  /. 


Note  that  for  some  0  <  a  <  h  <  I.  there  exists  a  S  >  0  such  that  for  all  a 
A(a,</i  )  ^  fi  for  i  t  l«.  A I 

where  A;< a.  </»•)  =  J*  A  ( v.  </r)  Air.  </»•>.  Hence,  hypothesis  /> ’  on  page  197  of  Doob  [4 ]  is 
satisfied.  Thus,  if 

A  ’(a.  ,1  )  =  P| t  1 1/.,  =  a) 
for  all  Borel  subsets  I.  then 
(3  4)  lim  A  "<  V..I )  =  //(  I ) 


exists  and  further  the  convergence  is  geometric 
I  A  "<  x.  4  )  -  //(  4  )  I  ay  ’ 
lor  some  positive  constants  «  and  y,  y  <  I  for  till  4 


Now  let 

//.  (  x  )  =-  PI/...  t  10.  v  1  j  /.  „  =  Ol. 

Then  a  renewal  argument  can  be  used  to  show  that  for  \  £  I 
(3  5)  //...,  lx)  -  J  //  *  A  l  i/r  I  ]/  (x)  /  (/  i  )  | 

*  |l  IIJI)\  J  KlJi  )  I /  ( x  )  /  (/  i  )] 


536 


»  I’  Ci AVI  R  AND  P  A  JACOBS 


Taking  limits  as  n  -»  ii  is  seen  that  the  distribution  //(a)  satisfies  the  following  equation  for 

(3.6)  Hix)  =  {  '  H  *  R  ( dv )  [Fix)  -  FU  -  v)] 

+  [1  -  HU)  1  Jt'  R(cly)  [FU)  -  FU  -  y)]. 

Equations  (3.1)  and  (3.6)  can  be  simplified  for  certain  specific  distributions  F. 

3.1  Exponential  Demands 

For  the  exponential  distribution  with  mean  I  and  v  <  /  the  equations  are 

(3.7)  //, (  v)  =  I  -  e  '  -  vc  ' 
and 

(3.8)  // (a- )  =  .vc  '//(/)  +  H\(x)  -  e  '  f  HU  -  x  +  u)  du. 

3.2  Uniform  Demands 

For  the  uniform  distribution  on  [0,  cl  with  c  >  /  they  simplify  to 


(3.9) 
and 

(3.10) 


// 1  (v)  =  exp 


//(.v)  =  -exp 
c 


!(/-.v> 

i  -  * 

exp 

i/ 

C 

C 

r 

-  (/  -  A) 


f  ' 


exp 


"71 


mu)  du 


+  —//(/)  -  —  exp 

i/ 

1  -  £  j 

r  exp 

1 

-  —  u 

c  c 

c 

c  J 

0 

c 

H(u)  du 


+  ll  -  //(/)!  /FU). 


for  a  $  /  Similar  expressions  hold  for  v  >  /,  but  they  are  unimportant  in  the  present  con¬ 
text 


liquations  (3.6),  (3.8)  and  (3.10)  do  not  seem  to  yield  explicit  answers.  As  a  result,  we 
have  solved  (3.8)  and  (3.10)  numerically  by  iteration  using  the  system  of  equations 

<3  II)  //., . ,  <  a  )  =  vc  '//„(/)  +  HAx)  -  e  '  J]  //„(/  -  v  +  u)du 

with  // 1  as  in  (3.7)  and 


(3  12) 


1 

-  (  /  -  A  ) 

C 1  ' 

l 

—  exp 

In  CXP 

- a 

t 

C 

l' 

//„  ( u )  du 


+  —  //„(  /  )  —  —  exp 


+  [I  -  //„(/)!  HAx) 


1 

v 

c 1 

1 

1  / 

1  -  - 

1  cxp 

—  —  u 

c 

l 

(l 

c 

//„  ( 1/ )  du 


with  // 1  as  in  (3.9)  For  the  cases  carried  out  the  convergence  is  rapid;  after  n  =  5  iterations, 
very  little  change  is  noted  and  convergence  has  occurred  for  most  practical  purposes. 
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Next  lei  )„  he  (he  amount  of  storage  space  user!  in  t he  //ill  bin,  the  distribution  of  is 
denoted  by  G'.A.x).  and 

G (  v )  =  lint  /'().,  S;  i !  -  lim  G.  i  \  ) 


is  the  long-run  distribution  By  probabilistic  arguments  and  (3  4) 

(3.13)  O'  ( v )  =  J  //  *  R(ih)  I  (/  -  i  >  +  11  -  //(/)]  J  R(Jv)  I  (I  y  I 

where  FU  -  y)  =  1  -  /(/  i )  and  the  long-tun  average  expected  capacity  of  a  bin  that  is 
actually  used  is 

I  =  J  v  (,'(,/\ I. 

for  the  case  in  which  /  is  exponential  with  unit  mean 

(3.14)  ,1  =  /  [1  //(/)!  II  c  ;|  <•  '  f  c'  //(vUv. 

•Ml 

lor  the  case  in  which  /  is  uniform  on  l<*.  *  1  with  c  -  / 


(3.15)  4  =  -2  J  IK  ii) tin  *  exp  -  /  J  e 

)//(/)[:/  <  exp 


exp 


-  ii 
c 


II  (l!  )  till 


I  -t  i  exp 


Numerical  solutions  were  obtained  for  I  qualions  (3  14)  and  (3.15)  by  first  computing  the  pro¬ 
babilities  //.  ( x ) .  n  ~  1.2.  .  .  10  iteratively  from  (3.7)  and  (3.11)  for  the  exponential 

demand  case,  and  from  (3.01  and  (3.12)  for  the  case  of  uniform  demands.  Our  technique  was 
simply  to  discreti/e  \:  \  ■=  ill.  Ii  -=  /,  V  \  being  the  number  of  x-values  at  which  //„ tv)  is 
evaluated  (values  of  V  from  200-1  200  were  utilized  in  order  to  obtain  two-significant  digit  accu¬ 
racy).  The  integrals  were  then  approximated  by  a  summation,  i.e.  Simpson's  rule.  Having  the 
values  of  II, Ax  )  it  is  possible  to  calculate  those  of  //  .  ,<\  ).  and  from  these  the  values  of 
lx)  and  (lie  mean  usage.  /■.'[)  I.  may  he  calculated  tu  numerical  integration  In  the  case  of 
exponential  demand  very  simple  upper  and  lower  bounds  were  obtainable,  such  bounds  were 
not  tight  enough  to  be  useful  for  the  uniform  case 


The  following  table  summarizes  the  numerical  results  We  have  compared  demand  distri¬ 
butions  that  result,  as  nearly  as  possible,  in  the  same  probability  that  an  initial  demand  on  an 
empty  bin  will  be  rejected  We  have  tabulated  the  expected  level  to  which  the  bin  is  filled.  It 
is  interesting  that  the  limiting  bin  occupancv  is  Do  when  a  uniform  demand  over  the  range  of 
the  bin  size  is  experienced  I  his  result  lias  been  obtained  analy  tically  by  Coffman  et  al.  |2|;  in 
that  paper  simple  am)  elegant  analytical  expressions  for  <;  and  //also  appear  for  this  case  The 
considerable  similarity  of  the  numbers  m  the  lows  ol  the  table  is  notable,  apparently  the  long- 
run  bin  occupancy  is  only  slightly  larger  than  is  that  of  the  first  bin.  and  the  occupancy  experi¬ 
enced  for  uniform  demand  is  only  slightly  larger  than  for  exponential  further  investigations  to 
examine  the  reasons  lor  this  insensitivity  would  seem  to  be  of  interest 


\(  KNOW!  KIH.MF  Vf  S 
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Expected  Fraction  of  Bin  Filled 
(/„  =  E[  K„]  4-  /) 

Rejection  Probability 

Exponential  Demand 

Uniform  Demand 

F(l) 

/ 1 

foo 

/. 

foo 

0.00 

— 

— 

0.76 

0.75 

0.05 

0.74 

0.75 

0.74 

0.74 

0.10 

0.69 

0.70 

0.72 

0.72 

0.15 

0.65 

0.66 

0.68 

0.69 

0.20 

0.60 

0.62 

0  64 

0.66 

0.25 

0.56 

0.58 

0.60 

0.62 
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ABSTRACT 

This  paper  considers  ihe  problem  of  modeling  the  reliahilitx  of'  a  repairable 
system  or  device  that  is  experiencing  reliability  improvement  Such  a  situation 
arises  when  system  failure  modes  are  gradually  being  corrected  h\  a  tesi-liv- 
tesi'lix  procedure,  which  ma>  include  design  changes  A  dwumte  rehabiliu 
model  lor  this  process  is  discussed  and  statistical  technuiues  arc  derived  for  es¬ 
timating  the  model  parameters  and  for  testing  the  gondness-uf-iii  in  observed 
data  The  rehabiliu  del  analv/ed  was  first  proposed  is  a  graphical  ici'himjuc 
known  as  Duane  p  hut  can  also  be  viewed  as  a  nonhomogeneous  Poisson 

process  with  a  particular  mean  value  function 


l.  INTRODUCTION 

Predicting  the  reliability  of  a  system  or  piece  of  equipment  during  its  development  process 
is  an  important  practical  problem.  Reliability  standards  are  often  a  major  issue  in  the  develop¬ 
ment  of  transportation  facilities,  military  systems,  and  communication  networks.  For  commer¬ 
cial  products  that  are  to  be  leased  and  maintained  in  a  competitive  marketplace,  system  reliabil¬ 
ity  estimates  strongly  influence  predicted  profitability  and  customer  acceptance.  When  consider¬ 
ing  a  system  that  is  modified  in  response  to  observed  failures,  most  classical  statistical  estima¬ 
tion  techniques  are  not  applicable.  This  is  because  the  system  reliability  is  improving  with  lime, 
while  most  statistical  techniques  require  repeated  samples  under  identical  conditions. 

A  frequently  used  graphical  model  of  reliability  growth  of  repairable  systems  is  known  as 
’’Duane  Plots,"  proposed  by  J.  T.  Duane  191.  This  model  is  based  on  the  empirical  observation 
that,  for  many  large  systems  undergoing  a  reliability  improvement  program,  a  plot  of  cumula¬ 
tive  failure  rate  versus  cumulative  test  time  closely  follows  a  straight  line  on  log-log  paper 
Several  recent  papers  present  applications  of  Duane  plots,  e.g.,  |4),  [9]  and  1 10).  Estimating 
the  parameters  of  the  Duane  model,  i.e.,  the  slope  and  intercept  of  the  straight  line  fit,  is  some¬ 
what  difficult  to  do  directly  on  the  graph  l5l.  Weighted  least  squares  and  regression  techniques 
are  sometimes  used  ( 191 ,  [101)  to  obtain  parameter  values. 

An  underlying  probabilistic  failure  model  that  is  consistent  with  the  Duane  reliability 
model  is  the  nonhomogeneous  Poisson  process  (NHPP)  whose  intensity  is  total  test  time  raised 
to  some  power.  (See  (7 ]  and  (8]l.  Assuming  the  sample  data  consists  of  all  the  individual 
failure  times.  Crow  (7)  derived  maximum  likelihood  estimates  for  the  Duane  model  parameters 
and  a  goodness-of-fit  test  based  on  the  Cramer-von  Mises  statistic  (Parien  1 1 2 .  p.  1431)  A 
more  general  NHPP  model  was  proposed  by  Ascher  and  I'eingold  1 1),  which  also  used 

*Now  with  Dept  of  hnpnccnng  I  cononm  S\ stems,  Si.mlonl  limetsitv.  Si.mliml.  (  \ 
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ihc  t'ramer-von  statistic  lor  goodness-of-lit  testing.  C'ritical  values  of  this  statistic,  how¬ 

ever,  must  be  obtained  by  Monte  Carlo  simulation  lor  each  sample  size.  Crow  [7,  p.  403]  cal¬ 
culated  and  tabulated  values  for  sample  sizes  up  to  sixty.  These  parameter  estimates  and 
goodness-of-fn  test  deal  effectively  with  Duane  model  applications  having  small  sample  sizes 
The  facts  that  all  failure  times  must  be  stored  and  the  goodness-of-fit  measure  must  be 
evaluated  by  simulation  make  this  approach  difficult  for  larger  sample  sizes.  A  recent  paper  by 
Singpurwalla  [13]  proposes  a  time  series  model  for  reliability  dynamics.  This  model  can,  of 
course,  be  applied  to  any  type  of  reliability  trend  data,  but  requires  data  tabulation  at  a  larger 
number  ot  time  stages  and  does  not  have  the  intuitive  appeal  of  the  Poisson  process  for  model¬ 
ing  failure  occurrences  in  certain  systems. 

Our  paper  develops  statistical  estimators  for  the  Duane  model  parameters  based  on  tabu¬ 
lating  the  number  of  failures  between  fixed  points  in  time.  This  approach  has  the  advantage  of 
using  "sufficient  statistics"  for  the  data  collection,  i.e..  the  dimension  of  the  data  does  not 
increase  with  sample  size  Parameter  estimates  are  obtained  by  maximum  likelihood  and  a 
goodness-ol -lit  test  based  on  the  l  isher  chi-square  statistic  is  derived.  This  test  has  the  advan¬ 
tage  that  chi-square  tables  are  readily  available  tor  all  sample  sizes  and  significance  levels.  The 
accuracy  of  the  chi-square  lest  decreases,  however,  as  the  sample  size  gets  small  Sample  sizes 
for  which  the  techniques  of  this  paper  apply  are  found  in  developmental  systems  that  experi¬ 
ence  frequent,  minor  failures  such  as  paper  jams  in  photo  copy  machines,  voltage  fluctuations 
m  power  supply  systems,  faults  in  semiconductor  manufacturing  processes,  etc.  The  last  sec¬ 
tion  of  this  paper  illustrates  the  application  of  the  estimation  and  goodness-of-lit  techniques  to  a 
representative  set  of  simulated  failure  data 

Regardless  of  how  the  parameters  of  the  Duane  model  are  obtained,  considerable  caution 
is  required  when  extrapolating  reliability  irends  beyond  the  observed  data  to  future  lime  points 
Major  breakthroughs  or  setbacks  in  the  reliability  improvement  program  may  cause  significant 
deviations  from  the  straight  line  projections  Some  users  recommend  reinitializing  the  model 
and  shifting  to  a  new  straight  line  fit  when  major  changes  in  the  program  occur,  Even  if  one  is 
uneasy  about  extrapolating  the  reliability  growth  model  to  estimate  future  reliability,  it  remains 
a  valuable  tool  for  obtaining  a  "smoothed”  estimate  of  current  system  reliability.  While  reliabil¬ 
ity  is  changing,  sample  sizes  at  any  point  in  time  are  not  sufficient  for  conventional  statistical 
estimation  techniques.  With  a  dynamic  reliability  model,  past  and  current  failure  data  can  be 
combined  to  obtain  estimates  of  current  reliability  based  on  lilting  all  observed  data. 

2.  THE  IH  ANF  MODE! 

The  Duane  model  states  that  cumulative  failure  rate  versus  cumulative  test  time,  when 
plotted  on  log-log  paper,  follows  approximately  a  straight  line.  More  precisely,  if  we  let  ,V((),r) 
represent  the  total  number  of  failures  observed  up  to  time  i.  we  have  that 

(2.1)  log l (0. / )/ z ]  =  -  h  log  t  4  a, 

where  the  lilted  parameters  are  a,  h  >  0.  The  relationship  is  meaningless  at  i  =  0  but.  as  most 
users  point  out  (|5|.|9)),  a  certain  amount  of  early  data  is  generally  excluded  from  the  lit 
because  it  is  influenced  by  factors  such  as  training  of  personnel,  changes  in  test  procedures,  etc 
Equation  (2.1)  therefore  implies  that 

Y(().r)/r  -  ai  h.  where  <«■  =  log  a, 

for  t  beyond  a  certain  point  It  should  be  emphasized  that,  in  all  applications,  time  / 
corresponds  to  cumulative  operating  time  or  test  time  Tor  the  results  of  this  paper  n  is  most 
convenient  to  write  the  Duane  model  as 


(2  2) 


\  (()./ ) 


n  i'1.  where  =  I  h. 
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For  a  fairly  diverse  set  of  observed  systems,  Codier  |5.  p.  460)  has  found  />  to  be  generally 
between  0.3  and  0.5,  corresponding  lo  fi  between  0.5  and  0.7. 

3.  AN  UNDERLYING  STATISTICAL  MODEL 

In  this  section  we  describe  a  statistical  model  for  the  failure  process  that  is  consistent  with 
assuming  that  the  observed  failure  daia  fits  the  Duane  model.  Suppose  the  probability  that  the 
system  fails  at  time  t  (strictly  speaking  in  a  small  interval  [i.i  +  dr)),  regardless  of  the  past,  is 
determined  by  a  hazard  function  hit).  That  is, 

hit)di  ~  Flthe  system  fails  in  the  interval [ r. /  +  dt)\. 

independent  of  its  previous  failure  and  repair  history.  The  expected  number  of  failures  in  any 
time  interval  |rlt/j)  of  operating  time  is  then  given  by  the  mean  value  function 

(3.1)  Mit\.  rO  =  f  h(t)dt. 

J'\ 

Furthermore,  it  can  be  shown  (See  Parzen  |12,  Sect.  4.2))  that  AM/,.  t:).  the  number  of 
failures  observed  in  some  future  time  interval  )/,,  /,),  has  probability  distribution 

(3.2)  P(,V(f,./:)  =  k)  =  ( l A/ ( r i .  /,))7A!)  exp{-.\/(r,.r:)|  A  =  0.  1,2 . 

In  addition  to  its  mathematical  convenience,  this  model  has  considerable  intuitive  appeal.  The 
simple  Poisson  process  has  been  used  successfully  to  model  the  failure  occurrences  of  many 
devices,  or  collections  of  devices  operating  in  series.  One  may  think  of  a  system  having  a 
nonhomogeneous  Poisson  failure  process  as  a  large  collection  of  simpler  devices  in  series,  with 
individual  device  failure  modes  being  gradually  removed  with  time. 

The  mean  value  function 

(3.3)  A/ ( r , ,  t:)  =  ai/if  -  /('  ),  where  «,  fi  >  0; 

corresponding  to  lt(t)  =  a  fit1'  ’,  is  of  particular  interest.  Crow  |7.  p.  4051  pointed  out  that  the 
number  of  failures  from  a  process  with  this  mean  value  function  will  approximate  the  Duane 
Model  by  observing  that 

log!  A/  ( / )/ r  1  =  logo  +  i(3  —  l )  log  t.  where  Mil)  —  MU),  t). 

This  means  that  system  failure  data  from  a  NlIPP  with  mean  value  function  Mil)  will  approach 
the  Duane  model  with  probability  one.  Conversely,  this  process  with  mean  value  function 
Mil)  is  the  only  model  with  independent  increments  that  approximates  the  Duane  model  in  a 
probabilistic  sense  for  sufficiently  large  sample  sizes.  We  will  not  give  a  proof  of  these  state¬ 
ments  but  refer  the  reader  to  Parzen  112,  ch.  4)  or  Donelson  1 8 1  lor  a  complete  discussion 

4.  SELECTING  A  STARTING  TIME 

The  Duane  reliability  model  and  the  expected  number  of  failt  res  in  Equation  (3  31  are 
both  nonlinearly  dependent  on  the  choice  of  the  time  origin.  That  is.  if  we  begin  observing 
failures  at  time  t  =  >  0  and  ignore  the  first  AMO.  /„)  failures  and  the  time  interval  |0.  /,,).  we 

do  not  obtain  the  same  parameters  «  and  13  by  filling  the  subsequent  data  Since  the  logarithm 
is  a  strictly  concave  function,  there  is  only  one  choice  of  that  can  give  a  straight  line  fit  to  the 
data  on  log-log  paper.  Specifying  the  operating  lime  o,  that  is  assumed  to  have  elapsed  before 
the  beginning  of  the  modeling  process  is  therefore  an  important  step 

Some  users  of  the  Duane  Model  ( ( 5 1 ,  ( 1 0)  >  suggest  reducing  the  cumulative  failures  and 
observation  lime  by  removing  early  data  to  obtain  a  straight  line  lit  1  his  is  done  graphically  by 


■K  I  \ 


>-  .  . -I'li'iimn  i he  Jala  until  a  straight  line  fit  is 

►  '  -  '•  ape  .a  i he  graph  nl  cumulative  failures  versus 

'  ■  Pending  (concave).  so  ii  is  not  hard  10 

.  .  .  .1 


■■  g-  'in-  i lend  on  log-log  paper  is  observed  before  the  noisy 
■  i  -  1  g  o  :n  Nhitied  luriher  10  the  right,  the  straight  line  shape  will 

<•  •  ■  1  ’  v  nil i>i  that  ean  be  said  in  this  ease  is  that,  for  i  greater  than 

' i )  ,  in.  nt. -tie I  I  he  siaiisueal  model  (3.3)  ean  still  be  applied,  how- 

i'vo  ”>■  *  ambei  ol  t.ulures  \(r  ./>  after  the  first  ,\' ( 0 . r , >  fit  the  NHPP 

VM!  '  'U ..  .1!'  .  i  .  '  .  v  ■  •  \f  1  f.*l  •  *  ' 
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ll  the  IHiaiie  mod is  applied  graphically,  the  user  ean  attempt  to  estimate  the  parameters 
o  and  0  bv  drawing  the  best  straight  line  through  the  plotted  points.  This  is  somewhat  tricky 
because,  with  cumulative  failure  data,  the  later  points  should  be  weighted  more  heavily  in 
determining  the  lit  This  section  describes  a  statistical  estimation  procedure  based  on  the 
MIPP  model  ot  the  latlure  process  We  consider  two  possibilities  for  collecting  and  recording 
system  failure  times  The  first  is  to  record  the  occurrence  lime  of  each  failure,  which  yields  a 

sequence  of  observed  times  T,,  7> .  7\.  This  case  has  been  analyzed  by  Crow  in  |7]  and 

the  maximum  likelihood  estimates  are  given  by 

(5.1)  a  *  =  .V//'C‘ 
and 

(5.2)  li*  =  ~.V/£  \og(TJTs). 

!  I 

A  goodness-of-fil  lest  corresponding  to  these  estimators  is  derived  in  [7]  and  critical  values  of 
the  error  statistic  are  tabulated  for  sample  sizes  2-60. 

If  large  numbers  of  failures  are  observed,  it  is  often  convenient  to  record  only  the  aggre¬ 
gate  number  of  failures  between  each  pair  in  a  sequence  of  fixed  lime  points  f„,  r, . i„.  In 

this  case  the  data  is  in  the  form  A',.  A. . A„.  where  A,  =  number  of  failures  observed  in 

the  interval  (/,  |,  r,).  Maximum  likelihood  estimates  and  a  goodness-of-fit  criterion  for  obser¬ 
vations  in  this  form  are  developed  in  the  next  few  paragraphs. 

Maximum  Likelihood  Estimates  for  the  Aggregated  Case 

We  first  calculate  the  likelihood  function  for  the  data  A t.  A; . A„,  given  the  time 

points  z.,.  rt . („  anJ  the  assumed  form  of  the  mean  value  function  in  Equation  (3.3).  The 

probability  of  A,  system  failures  in  the  interval  I r,  ,,  z, )  is  obtained  from  Equation  (3.2).  Since 
the  underlying  model  assumes  that  each  of  the  lime  segments  is  independent,  the  likelihood 
function  can  be  written  as  a  product  of  these  probabilities, 

(5.3)  I.Ut.fi)  =  n  Pl.Vfr,  I.  /,)  -  A,)  -  expi-M(f,,.f„)|  r,)]V/A,!). 

-i  i-t 


To  simplify  the  calculation  of  the  estimators,  we  take  the  log  of  Ha.  fi).  noting  that  max¬ 
imizing  the  log  will  yield  the  same  maximum  likelihood  estimates.  From  (5.3)  we  have 

log /.(«./))  =  -aiit'  -  lit )  +  £  A,  (logo  +  log(/,'*  -  if*  i  >1  -  £logA,l. 

r~l  <-l 


(5  4) 


F 
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Taking  the  partial  derivatives  (91og  L)/da  =  0  and  Olog  /.  >/d/3  =  0,  we  obtain  the  equations 
for  the  maximum  likelihood  estimates. 

a*  =  N/Uf-  11!').  where  ,V  =  ]T  V 

.  - 1 

„  "  log  r,  p ,  log  r,  |  log  r„  -  p„logr„ 

0  =  >  A,  — - - - . 

,7^  1  -  P,  1  -  P,i 

,  =  (r,  ,/ /, )y *.  i=1.2 . /i.  and  /»„  =  (r„/r,,)"". 

Equation  (5.6)  is  an  implicit  function  of  fi  *.  but  can  be  solved  iteratively  by  a  computer  algo¬ 
rithm  or  programmable  calculator,  because  the  right  hand  side  is  strictly  decreasing  in  fi  *  To 
verify  this  fact,  consider  any  two  limes  /.  /'  and  compute  the  derivative 

(5.7)  (d/00)[logr  -  r"  log  f  1/11  -  rf|  =  -  /"(log  7)7(1  ~  /")’.  where  /  =  r/r’. 

This  derivative  is  negative  and  decreasing  in  /  for  0  <  T.  fi  <  1.  The  derivative  of  the  sum  in 
(5.6)  is  a  sum  of  terms  involving  the  difference  of  the  derivative  (5.7)  evaluated  at  T  —  i ,  J t 
and  t0/t„.  The  fact  that  (5.6)  is  decreasing  in  fi  *  follows  from  the  fact  that  (5.7)  is  decreasing 
in  T,  i.e.,  its  largest  or  least  negative  value  occurs  at  T  =  rn/r„.  Therefore,  (5.6)  has  a  unique- 
solution. 

6.  GOODN ESS-OF-FIT  CRITERION 

This  section  describes  a  procedure  for  testing  the  goodness-ol-lit  of  the  observed  failure 
data  to  the  NHPP.  We  assume  that  the  parameters  or  *  and  fi  *  are  obtained  from  the  maximum 
likelihood  estimates  (5.5)  and  (5.6).  From  the  form  of  (5.5),  it  is  clear  that  the  estimate  a  *  is 
defined  in  such  a  way  that  the  total  number  of  observed  failures  K  always  equals  ihe  expected 
number  of  failures  for  the  lime  period  |r„.  („) .  That  is,  <t  *  is  defined  so  that 

n  =  0*)  =  a*uty -  in'). 

Therefore,  there  is  no  difference  between  the  observed  versus  predicted  tom  I  number  of 
failures.  The  goodness-ol-lit  measure  must  therefore  be  based  on  the  differences  between  the 
observed  incremental  failures  N /V-, . K„,  and  the  predicted  values 

(6.1)  £' |  A/,  |or  *.  0  * )  =  (*  *  ( l/f  *  -  if1  *  ) ,  (=1,2 . 

Assuming  the  estimate  (5.5)  is  used  for  <*  *,  the  likelihood  function  for  a  goodness-ol-lit 
statistic  will  he  expressed  only  in  terms  of  fi  *  Since  the  NHPP  has  independent  increments, 
the  probability  that  a  given  failure  occurs  in  the  interval  |i  , .  r  )  is  the  expected  number  ol 
failures  for  that  interval,  divided  by  the  loial  number  of  failures.  This  is  written  as 

(6.2)  /;,  =  />,(0*>  =  la  Mr,"*  -  r"',  )|/|a  Mr"*  r"*>l.  /  =  1.2 . n. 

where  the  a*  parameter  obviously  cancels  out  The  likelihood  function  for  a  set  of  observed 
failures  ,V ,,  ,\\ . given  /V.  is  therefore  the  multinomial 

\  \ ,  \ 

Pi  p;  ■  ...  p„  .  where  .V,  +  ,V.  +  .,.■*  V,  »  V. 


;V 

•V|.  .V . . v  , 


(5.5) 

(5.6) 
where 


which  depends  only  on  fi  *  The  parameter  a  ’can  he  regarded  as  a  scale  parameter  that  guaran¬ 
tees  the  model  will  lit  the  total  number  observed  of  failures  V 
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We  now  show  how  the  gooJness-of-lit  of  the  incremental  failure  data  can  be  measured  by 
the  fisher  chi-square  statistic 

(6.4)  *:  =  £  (ly  -  NP,):/NPl. 

i  - 1 

The  use  of  this  statistic  as  a  goodness-of-fit  measure  is  based  on  the  following  theorem,  which 
has  been  restated  in  the  context  of  this  discussion. 


THEOREM  6.1:  I.et  the  parameters  /» , ,  p:.  . . .  p„ .  with  Lp,  =  1 ,  be  functions  of  a  param¬ 
eter  fi  and  let  a  particular  value  fi'  be  determined  from 


(6.5) 


(I  = 


£  (A  'Jp,)(bpJbfi) 


la  -  a' 


Then  the  statistic  (6.4)  with  />,  =  p,{fi').  i  =  1.2 . has  approximately  a  chi-square  distri¬ 

bution  with  n  -  1  degrees  of  freedom  (*’("  -  1))  for  large  N.  The  proof  of  this  result  is  quite 
lengthy  and  can  he  found  in  (6,  pp.  424-4341. 


To  apply  this  result  to  our  particular  problem,  we  must  show  that  fi'  equals  the  estimator 
fi  *  defined  by  liquation  (5.6).  Using  p,(fi)  as  defined  in  Equation  (6.2).  and  differentiating 
with  respect  to  fi.  one  can  verify  that  Equation  (6.5)  reduces  to  Equation  (5.6).  Thus,  fi'  =  fi' 
and,  since  (5.6)  has  only  one  solution,  the  value  is  unique. 


The  chi-square  error  statistic  (6.4)  has  an  additional  intuitive  interpretation  for  this  appli¬ 
cation  Suppose  a  and  fi  are  the  "true"  parameters  of  the  underlying  nohomogeneous  Poisson 
process,  i.e.,  the  values  to  which  the  estimators  «  *  and  fi  *  must  eventually  converge  for  very 
large  sample  sizes.  Then  the  "true"  variance  of  the  number  of  observed  failures  in  I /,  , .  z, > . 
i.e..  the  limiting  value  for  the  sample  variance  of  a  large  number  of  observations,  is  given  by 

Var|.V.  |«, /S)  =  « ( tf*  \  -  tf1)  (=1,2 . n. 

Consider  *./(*)  =  £  (A',  -  K{N,\a  *.  f)  *|>7Var|  A',|<».  fi\. 


which  is  the  sum  of  square  errors  between  the  observed  and  estimated  failures,  weighted  by  the 
true  variance  for  each  of  the  lime  intervals.  Suppose  we  minimize  this  with  respect  to  a  *  and 
fi  *  b>  solving  (f)ir,rji»*>  =■  0  and  fdW/dfi')  =  0  If  we  then  substitute  our  "best  estimates", 
o  *  for  or  and  fi  *  lor  fi ,  these  two  equations  reduce  to  the  maximum  likelihood  equations.  (5  5) 
and  (5  6),  respectively  Birnbaum  [2,  p.  251-2)  also  points  that  if  we  minimize  the  chi-square 
statistic  (6  4)  with  respect  to  fi.  the  estimate  obtained  must  approach  the  estimate  fi’  that 
satisfies  (6  5)  as  the  sample  size  approaches  infinity. 

This  goodness-of-fit  criterion  measures,  in  effect,  how  well  the  observed  data  tils  a  NHPP 
with  mean  value  function  M(i),  where  fi  *  is  the  "best”  growth  parameter  for  the  observed  data 
If  the  x  ' In  -  I)  statistic  (6.4)  exceeds  the  critical  value  at  a  reasonable  significance  level,  such 
as  0.05  or  0.1,  the  model  should  be  rejected.  Since  Theorem  6  1  gives  only  an  asymptotic 
result,  it  is  important  to  discuss  the  sample  size  requirements  for  applying  it  Given  the  popu¬ 
larity  of  this  test,  there  has  been  considerable  experience  with  various  types  of  data  A  com¬ 
mon  criterion  is  that  ,V  and,  in  this  case  the  time  points  .  r„,  must  be  such  that 

A p  $  10  for  all  /.  (See  Birnbaum  [2,  p  248]) 
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7.  APPLICATION  EXAMPLE 

As  an  illustration,  we  wilt  determine  the  estimators  a  *  and  /0*and  apply  t  he  good  ness- of- 
fit  test  to  the  sample  data  in  Table  1.  We  assume  that  the  failures  of  the  system  were  only 
monitored  at  fixed  points  of  time  so  that  the  observed  data  consists  of  the  first  two  columns  of 
the  table.  These  data  points  were  generated  by  computer  simulation  with  failures  sampled  from 
a  NHPP  with  mean  value  function  M(t),  having  parameters  «  =  10.0.  fi  =  0.5.  f  ailure  times 
T |,  T,.  ...  from  this  distribution  can  be  generated  sequentially  from  a  set  of  random  samples 
U U . . .  from  the  uniform  distribution  by  means  of  the  transformation 

(7.1)  rM,  -  177*  -  <l/«)log  t/,*,]1'".  0,  /  =  ().  1.2 . 

TABLE  1 


Time  Interval 

Observed 

Failures 

Predicted 

Failures 

Standard 

Deviation 

Normalized 

Lrror 

1 

400  -  800 

63 

78 

8.8 

2.88 

2 

800  -  1 200 

63 

61 

7.8 

0.07 

3 

1200  -  1600 

54 

51 

7.1 

0.18 

4 

1 600  -  2000 

51 

46 

6.8 

0.54 

5 

2000  -  2500 

68 

51 

7.1 

5.67 

6 

2500  -  3000 

49 

46 

6.8 

0.20 

7 

3000  -  3500 

34 

43 

6.6 

1.88 

8 

3500  •  4000 

39 

40 

6.3 

0.03 

9 

4000 4500 

39 

38 

6.2 

0.02 

10 

4500  -  5000 

43 

36 

6  0 

1.36 

11 

5000  -  5500 

39 

34 

5.8 

0.74 

>2 

5500  -  6000 

36 

33 

5-7 

0.27 

13 

6000  -  6500 

28 

31 

5.6 

0.29 

14 

6500  •  7000 

22 

30 

5.5 

2.13 

15 

7000  -  7500 

35 

29 

5.4 

1.24 

16 

7500  -  8000 

32 

28 

5.3 

0.57 

17 

8000  -  8500 

22 

27 

5.2 

0.93 

18 

8500  -  9000 

19 

27 

5.2 

2.37 

19 

9000  -  9500 

19 

26 

5  1 

1.88 

23  25 

The  data  in  fable  I  was  used  to  obtain  maximum  likelihood  estimates  «  *  and  /)  *  from 
Equations  (5.51  and  (5.6).  This  was  done  by  calculating  various  values  of  the  right  hand  side 
of  (5.6)  as  a  function  of  (i  until  the  minimizing  value  /3  *  was  determined  to  two  decimal  places 
This  gave  fi*  *  0.52  and  «  *  =  7.97,  where  <*  *  was  determined  from  (55)  with/)*  =  052 


The  accuracy  of  /i  *  is  reasonably  close  to  the  correct  value  0  -  0  5.  hut  the  estimate  of  <>  ' 
is  off  by  more  than  20%.  Other  calculations  with  different  sets  ol  random  numbers  produced 
errors  m  both  directions  but  generally  resulted  in  an  <*  *  error  several  times  larger  than  the  ti  * 
error,  on  a  percentage  basis  This  seems  to  indicate  that  one  is  more  likely  to  estimate  slopes 
of  the  Duane  Plot  lines  accurately  than  to  estimate  the  intercepts  accurately  with  the  maximum 
likelihood  estimates  Naturally  ,  as  the  number  of  observation  points  in  I  able  1  is  increased 
the  estimates  become  more  accurate.  Accuracy  was  not  improved  much  b\  increasing  the 
number  of  time  points  from  20.  as  shown  in  the  table,  to  lot)  and  the  sign  ol  the  error  lot  a 
given  example  generally  did  not  change  as  the  number  ol  observation  points  was  increased. 
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while  holding  the  underlying  failure  points  fixed.  Bringing  the  estimate  a  *  to  within  5%  of  the 
correct  value  typically  required  300  to  500  observation  time  points  for  the  computed  examples 

To  illustrate  the  use  of  the  goodness-of-lit  test  we  calculate  the  chi-square  statistic  (6.4) 
for  this  table.  The  "Predicted  Failures"  between  the  various  time  points  are  given  by 

Mp.  =  <»*(','!*  -  1-1.2 . 19. 

The  normalized  error  terms  as  in  (6.4)  are  given  by 

( .v  —  \p  )  V  ( Mp. ) . 

The  sum  of  these  errors,  when  compared  with  a  \  <  18)  error  table,  is  less  than  the  critical 
values  25.99  and  2887.  associated  with  significance  levels  01  and  0.05.  respectively. 

For  many  applications  of  the  model  it  is  more  important  to  predict  the  number  of  failures 
that  will  occur  in  the  next  time  period  than  to  obtain  accurate  estimates  for  <*  and  /3  In  such 
cases  the  estimators  obtained  from  10-20  lime  points  appear  to  be  sufficiently  accurate.  This  is 
because  there  is  a  range  of  <».  /3  pairs  that  provide  almost  as  good  a  lit  to  the  observed  data  as 
the  optimal  ones  and  any  parameters  in  this  range  provide  a  satisfactory  predictive  model. 

To  illustrate  the  prediction  accuracy  of  the  estimates  fi  *  =  0.52.  a*  =  7.97  obtained  from 
Table  1,  we  generated  simulated  failures  out  to  40,000  time  units.  The  number  of  failures 
predicted  by  extrapolating  with  the  estimated  parameters  and  with  the  true  parameters  are  com¬ 
pared  in  Table  2  The  errors  in  predicting  failures  caused  by  inaccuracy  in  estimating  the 
parameters  is  much  less  than  the  random  errors  that  occur  due  to  stochastic  variations  of  the 
failure  process.  This  was  found  to  be  the  case  in  several  sim  ;ir  experiments. 


TABU  2 


Time  Interval 

Simulated 

Failures 

list  i  mated 
extrapolation 

True 

extrapolation 

Standard 

Deviation 

9500  - 

10. 000 

24 

25 

25 

5.0 

9500  ■ 

15.000 

235 

251 

250 

15  X 

9500  - 

20.000 

412 

443 

439 

21  0 

9500  - 

30. 000 

715 

7tih 

75’ 

11  5 

9500  - 

40.000 

999 

1041 

1025  1 

32  0 

H.  C  ONC  l.l  SION 

C  hoosing  the  fixed  time  points  between  which  to  tabulate  failures  is  mainly  a  question  ol 
engineering  judgement.  The  time  points  might  be  selected,  for  example,  to  correspond  to  mile¬ 
stones  in  the  reliability  development  program  The  parameter  estimates  and  goodness-of-lit 
tests  obtained  in  this  paper  and  those  obtained  by  C  row  are  essentially  complementary  with 
respect  to  various  applications  of  the  Duane  model  It  is  not  possible  to  determine  the  precise 
sample  size  at  which  one  approach  becomes  more  advantageous  than  the  other  Based  on 
experience,  the  chi-square  good ncss-of- lit  test  tends  to  reject  most  sample  data,  including  data 
that  lits  the  model,  when  sample  sizes  arc  too  small  Therefore,  rejection  of  the  model  by  the 
chi-square  test,  based  on  data  with  a  questionable  total  number  of  samples,  might  be  viewed  as 
inconclusive  and  the  more  accurate  test  developed  by  Crow  could  then  be  applied  For  large 
sample  sizes  that  have  at  least  10  failures  between  time  points,  the  chi-square  test  should  be 
accurate  and  is  computationally  easier  Data  that  fails  to  lit  the  NHPP  model  with  mean  value 
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function  M(i)  based  on  these  tests  requires  a  more  general  approach.  A  NHPP  model  with  a 
different  intensity  such  as  discussed  in  111.  or  a  less  constrained  model  such  as  [13)  might  then 
be  tested. 
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ABSTRACT 

An  approach  is  presented  lor  obtaining  the  moments  and  distribution  of  the 
optimal  value  tor  a  class  of  prototype  stochastic  geometric  programs  with  log- 
normally  distributed  cost  coefficients  It  is  assumed  lor  each  set  of  values 
taken  on  by  the  cost  coefficients  that  the  resulting  deterministic  primal  program 
is  superconsistent  and  soluble  It  is  also  required  that  the  corresponding  dual 
program  has  a  unique  optimal  point  with  all  positive  components  It  is  indicat¬ 
ed  how  one  can  applv  the  results  obtained  under  the  above  assumptions  to  sto¬ 
chastic  programs  whose  corresponding  deterministic  dual  programs  need  not 
satisly  the  above-mentioned  uniqueness  and  positivity  requirements 


I.  INTRODUCTION 

This  paper  is  concerned  with  deriving  the  distribution  and/or  moments  of  the  optimal 
value  for  a  class  of  stochastic  prototype  geometric  programs  in  which  a  subset  of  the  cost 
coefficients  are  lognormally  distributed.  The  ,nograms  are  assumed  to  be  superconsistent  and 
soluble  for  all  positive  values  that  can  be  taken  on  by  the  cost  coefficients.  It  is  also  required 
that  the  dual  of  a  program  has  a  unique  optimal  point,  8, ,  with  all  positive  components,  for  all 
possible  values  that  can  be  taken  on  by  the  components  of  the  cost  vector  c.  Such  programs 
include  soluble  programs  with  no  forced  constraints.  Also  included  are  superconsistent  soluble 
programs  whose  forced  constraints  are  nonredundant  (and  hence  active  at  optimality)  and 
whose  forced  constraint  gradients  are  linearly  independent  at  optimality,  for  each  positive¬ 
valued  cost  vector  c. 

The  class  of  problems  specified  above,  while  of  interest  in  themselves,  can  be  used  to 
obtain  the  distribution  and/or  moments  of  the  optimal  value  for  more  general  classes  of  sto¬ 
chastic  prototype  geometric  programs.  This  will  he  indicated  in  Section  6. 

The  distribution  and/or  moments  of  the  optimal  value  of  a  stochastic  program  will  be 

expressed  in  terms  of  the  density  function  of  a  vector  l.  \  (log  A',,,  log  A', . log  A',,)', 

where  log  denotes  the  natural  logarithm  function,  d  is  the  degree  of  difficulty  of  the  program, 
and 
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lot;  K  -  y  h  log  t  for  /  >  |l>.  |.  .  t/1 

.  - 1 

In  the  above  If,.  . .  .  .  cnV  is  ihe  vector  of  cost  coefficients  (where .  denotes  transpose*  and 

the  b'  '  are  constants  that  are  independent  of  the  c,. 

One  advantage  to  deriving  the  distribution  and/or  moments  of  the  optimal  value  in  terms 
of  the  density  function  of  1.  is  (hai  the  vec  ’  is  normally  distributed  when  the  stochastic  c 
are  jointly  lognormallv  distributed.  Furthern,  e,  under  certain  conditions,  it  is  reasonable  to 
expect  that  /  behaves  approximately  as  if  the  vector  of  stochastic  cost  coefficients  were  lognor¬ 
mallv  distributed  even  when  it  is  not.  More  precisely,  if  ihe  stochastic  cost  coefficients, 
tc  \i  t  /!.  are  positive-valued  and  the  variates  (log  c:|/  t  /|  are  independent  with  finite  means, 
variances,  and  third  order  absolute  central  moments,  then  one  can  apply  a  central  limit  theorem 

for  random  vectors  to  the  relation  /.  —  £  7.'".  where  /.  ‘  \  (/>  "'  log  i  .  b  log  i . h 

,-i  _ 

log  i  )'  (111  Thus,  under  the  above  conditions,  one  might  expect  that  /  tends  to  he  normally 
distributed  imnidcd  the  stochastic  c,  are  positive-valued,  strictly  ummodal.  continuous  vari¬ 
ates.  the  number  of  indices  in  /  is  "large"  in  comparison  to  J  +  ! .  and  no  partial  sum  of  J  ♦  1 
of  the  /  is  "excessively"  dominant  in  the  sum  for  L 

I  he  results  of  this  paper  should  be  of  interest  in  instances  where  the  operating  or  con¬ 
struction  costs  associated  with  a  contemplated  project  or  engineering  system  can  be  adequately 
approximated  as  the  optimal  value  of  a  stochastic  prototype  geometric  program  with  lognormallv 
distributed  cost  coefficients  In  such  cases  a  knowledge  of  the  distribution  function  and/or 
moments  would  he  useful  as  a  predictive  tool  in  financial  planning  for  instance,  if  the  distri¬ 
bution  function  of  the  optimal  value  were  known  one  would  be  able  to  predict  with  a  given  pro¬ 
bability  that  a  proposed  system's  operating  or  construction  costs  incurred  over  a  given  period 
would  lie  within  a  specified  set  of  limits 

To  reflect  the  uncertainty  as  to  the  future  costs,  c, .  that  will  be  encountered  during  the 
construction  or  operating  period  of  interest  a  cost  analyst  often  subjectively  chooses  a  distribu¬ 
tion  for  each  cost  c  Cost  analysts  have  frequently  found  families  of  positive-valued  random 
variables  that  are  continuous  and  strictly  ummodal  useful  lor  this  purpose  |9l  The  lognormal 
random  variables  form  a  two  parameter  family  that  meets  these  specifications.  Recall  a  random 
variable  V  is  said  to  be  lognormal  iff  log  V  is  normally  distributed  Properties  of  lognormal  ran¬ 
dom  variables  can  be  found  in  1 2 1 

Cost  analysts  are  most  often  concerned  with  the  distribution  of  values  of  c,  about  a  central 
value  and  not  with  tail  values  Thus,  an  analyst  who  wishes  to  utilize  the  present  results  might 
proceed  to  express  his  uncertainty  about  the  future  value  of  cost  coefficient  <■,  as  follows 

1  Assume  c  is  lognormallv  distributed  and  subjectively  choose  the  median  value  of  (  . 
denoted  by  i 

2  Specify  an  interval  ol  interest  about  Z  of  the  form  (W  'f,.  W  Z)  where  W  t  (I,  «>). 

3  Subjectively  choose  fi  f  •().  II  such  that  I  -  fi  reflects  one's  belief  that  c,  €  (W  1 2 3  /  . 

b  (  ).  i  e  .  the  more  confident  one  is  that  <  (  .  «  £  I  the  closer  I  -  fi  should  be  chosen 
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4.  Calculate  the  unique  value  of  the  standard  deviation  of  c,  that  is  consistent  with  (1) 
and  the  equation  /V(0/"l£,  <  c,  <  3,%,)  =  1  -  S,  where  Pr  denotes  the  probability  function 
associated  with  c,. 

Results  of  the  paper  do  not  require  that  the  stochastic  c,  be  independently  distributed. 
Thus,  for  every  pair  of  stochastic  cost  coefficients  c,,  c,  (i  ^  j)  the  analyst  may  subjectively 
choose  a  number  between  -1  and  1,  the  correlation  coefficient  p„  of  log  c,  and  log  cn  to  reflect 
his  opinion  as  to  the  interdependency  of  c,  and  cr  The  theory  allows  for  the  possibility  that 
Pij  =  ±1  (i.e.,  with  probability  1  c,  =  acf  for  some  constants  a  €  (0,  and  ft  €  (— <»,  «>)). 

In  Section  2  the  notation  used  in  connection  with  the  deterministic  and  stochastic 
geometric  programming  problem  and  its  dual  and  transformed  dual  is  presented.  Also  the  spe¬ 
cial  role  of  the  transformed  dual  program  in  obtaining  the  distribution  and/or  moments  of  the 
optimal  value  of  the  primal  program  is  indicated. 

Section  3  presents  and  discusses  the  assumptions  placed  upon  the  primal  program 
throughout  Sections  3  through  5  and  the  appendices.  Additionally,  useful  properties  of  the 
density  functions  of  L  and  /.  A  (log  K , . log  K,,)'  are  stated. 

In  Section  4  we  use  the  density  functions  of  L  and  L ,  together  with  the  maximizing  equa¬ 
tions  for  an  unconstrained  transformed  dual  program,  to  obtain  the  density  functions  of  rand 
(r,  v (/*,  )) .  Here  r  denotes  the  random  vector  of  the  optimal  point  of  the  unconstrained  sto¬ 
chastic  transformed  dual  program  and  v(P, )  denotes  the  optimal  value  of  the  stochastic  primal 
program.  We  then  obtain  the  density  function  of  v(P, )  as  a  marginal  density  of  (r.v(PJ). 

In  Section  5  we  use  the  density  function  of  r  to  derive  a  formula  that  expresses  each 
moment  of  v ( /*, )  as  the  integral  of  an  explicitly  given  integrand  over  an  explicitly  specified  con¬ 
vex  polyhedral  subset  of  R‘‘ ,  where  d  is  the  degree  of  difficulty  of  the  stochastic  primal  pro¬ 
gram. 


Section  6  briefly  indicates  how  the  preceding  results  can  be  used  to  calculate  the  distribu¬ 
tion  and/or  moments  of  v(// )  when  P,  need  not  satisfy  all  the  assumptions  of  Section  3. 

Appendix  A  contains  the  statement  and  proof  of  a  lemma  from  which  important  proper¬ 
ties  of  L  and  L  immediately  follow.  These  properties  are  stated  in  Theorem  1  of  Section  3. 

Finally,  in  Appendix  B  we  establish  that  boundedness  of  the  dual  feasible  set  is  a 
sufficient  condition  for  the  existence  of  all  the  moments  of  v(Pt).  under  the  assumptions  of 
Section  3. 

2.  NOTATION  AND  PRELIMINARIES 

We  shall  now  review  the  definitions  and  notation  used  in  connection  with  prototype 
geometric  programming  that  will  be  utilized  in  the  paper  In  the  following,  for  every  positive 

integer  v,  <t>  A  (1 . »>|  and  <?>  A  (0,  I . r).  Also,  for  every  matrix  P,  P'  denotes 

the  transpose  of  P  All  elements  of  Huclidean  rt-space.  R",  will  be  viewed  as  column  vectors  of 
n  real  numbers  and  the  zero  vector  will  be  denoted  by  u. 

Recall  a  prototype  primal  geometric  program  has  the  following  form  |4):  inf  > ?„(/)  subject 
to  tfK(t )  <  1  V  k  e  <  p>  and  /,  >  0  V  /  €  <  m>  where  t  =  <  r, . and  .eK  </ )  A  £  c, 
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f|  t,  for  «€</)>.  In  the  above  A  =  (a„)  is  an  n  by  m  matrix  with  real  entries  called  the 

/-i 

exponent  matrix  and  c  =  (c, . c„)'  is  a  vector  of  positive  numbers  called  the  vector  of  cost 

coefficients.  Also.  JK  A  I  w„  .w* .  i . where  =  1,  mH  =  nKl  +  1  for  k  6  <p>,  and 

np  =  n.  The  constraints  gK(t)  <  1  are  called  forced  constraints  and  we  allow  the  possibility  that 
a  primal  program  has  no  forced  constraints. 

In  this  paper  we  shall  be  concerned  with  problems  of  the  above  form  when  some  or  all  of 
the  cost  coefficients  are  stochastic  variables  that  are  lognormally  distributed.  Thus,  we  shall 
assume  there  exists  /  C  <n>  such  that  /  ^  <t>  and  i  €  /  iff  c,  is  stochastic.  Let 

i’i  A  (c, . c,  )'  where  /,  <  ...  <  iw  and  /  =  { I  v  €  <<u>).  Thus,  17  is  a  random  vector 

formed  from  the  stochastic  cost  coefficients.  Values  taken  on  by  c ,  will  be  denoted  by  17.  Also 
c  will  denote  the  value  taken  on  by  cost  coefficient  vector  c  when  0,  takes  on  the  value  c,.  We 
shall  let  P  and  P  denote  the  corresponding  stochastic  and  deterministic  prototype  primal 
geometric  programs  Furthermore,  v(  P  )  will  denote  the  optimal  value  of  P  and  v(  P ')  will 
denote  the  stochastic  variable  that  takes  on  the  value  v(  P  )  when  <7  takes  on  the  value  c,. 

The  stochastic  program  P,  is  not  convenient  to  work  with  due  to  possible  randomness  in 
coefficients  of  the  forced  constraints.  To  find  computationally  tractable  bounds  on  the  solution 
of  a  two  stage  geometric  program  with  stochastic  cost  coefficients,  Avriel  and  Wilde  [3 ]  con¬ 
sidered  the  stochastic  problem  D  in  place  of  P,  where,  for  every  c  >  0,  D  is  the  dual  of  P  as 
given  in  14].  The  stochastic  program  D,  has  the  attractive  feature  that  all  its  randomness  is 

confined  to  the  objective  function.  To  see  this  recall  D  is  the  following  program:  sup  ]~] 

1 

I  )  ,,‘l 

If, /ft/  fl  '  subject  to  the  normality  condition  £  ft,  =  I.  the  orthogonality  conditions 

an  ft,  =  0  for  /  €  <»/>.  and  the  positivity  conditions  ft,  ^  0  for  /  €  <n>  In  the  above. 
,  ■  1 

for  every  k  €  </>>.  A, (ft)  A  ^  ft,  for  ft  €  R"  Also,  in  evaluating  the  dual  objective  function 

~  "A 

one  uses  the  convention  that  v'  =  x~'  =  1  for  x  =  0.  When  Z5  has  no  forced  constraints  we 
set  p  -  0  and  define  the  expression  (S )A“ fi'  to  be  1. 

«-i 

Under  rather  general  conditions  one  has  v(A>  )  =  v(  P)  for  c  €  R",  [4,  Ch.  6]  (where  R 
denotes  the  positive  orthant  of  /?"and  v(D  )  denotes  the  optimal  value  of  D  )  This  is  iruc. 
e  g,  if  P  is  superconsistent  and  soluble  [4,  Ch.  4],  Thus,  frequently  the  distribution  function 
of  v(/)  )  will  be  the  same  as  that  for  v(P, ),  where  v(D, )  denotes  the  stochastic  variable  that 
takes  on  the  value  v ( />  )  when  <7  lakes  on  the  value  (7.  Obtaining  the  distribution  function 
and/or  moments  of  v(D  I  is  facilitated  by  the  fact  that  the  constraint  region  for  Z),  is  a 
polyhedral  convex  set  that  depends  only  on  the  nonstochastic  exponent  Matrix  A 

Instead  of  working  directly  with  D  we  shall  use  the  transformed  dual  program.  D  ,  con¬ 
sidered  in  [4,  Ch.  3).  Recall  /)  is  obtained  from  D  by  solving  the  normality  and  orthogonality 
constraints  of  l) 

In  what  follows  we  shall  assume  without  loss  of  generality  that  the  rank  of  A  is  wand  that 
q  t  R’!  is  not  in  the  column  space  of  A,  where  </,  =  1  if  /  <  nn  and  q,  =  0  if  1  >  n,,  (sec  14. 
Ch.  3)1  As  in  14)  we  define  d  to  be  the  dimension  of  the  solution  space  of  the  system  of  equa¬ 
tions  I 'ft  =-  (_).  q  'ft  =  0  (Recall  d  is  called  the  degree  of  difficulty  of  P  and.  under  the  above 
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assumptions,  equals  n  -  m  —  1.)  Throughout  the  paper  we  assume  d  >  0.  (The  distribution 
problem  for  v(Pt.)  when  d  =  0  has  been  studied  by  R.  Stark  in  [13].)  In  accordance  with  the 

terminology  in  [4],  we  define  N  A  |  j  6  <d> }  to  be  a  nullity  set  for  Pt  iff  /Vis  a  basis  for 

the  solution  space  of  the  above  homogeneous  system  of  equations.  Also  bm  €  R"  is  called  a 
normality  vector  for  Pt  iff  A'b 101  =  0  and  q‘bW)  =  1. 

Let  /V  A  j  €  <t/>]  be  any  nullity  set  and  bu,‘  be  any  normality  vector  for  Pt 

—  j 

Note  8  €  R"  satisfies  the  orthogonality  and  normality  conditions  for  D  iff  5  =  b,ni  +  £  r.b'  ' 

1 

where  the  r;  6  /?'  are  uniquely  determined  by  S.  Thus,  by  replacing  5,  in  D  by  +  X 

.  - 1 

r,b,Ui  we  obtain  the  equivalent  transformed  dual  problem  D  : 

sup  K(c.bm)  YlKic.b11')  ns,(r)  Sl<''  |7XK(r)A“'r’ 

'  1-1  l~ I  ««l 

subject  to  the  positivity  constraint  Rr  >  —b""  where  r  A  (r, . r^Y  (the  vector  of  basic 

variables).  In  the  above  { K  (c.b'1 ')  I  j  €  <d> )  is  called  a  set  of  basic  constants  for  P 

_  «  b  ' '  ’  - 

(corresponding  to  /V  and  b,m)  where  K(c,b>,<)  A  c,  for  /  €  <d>.  Also.  B  is  the  why  d 

~ 

matrix  whose  yth  column  is  b'1'  for  j  6  <d>.  Finally,  for  i  €  <n>  and  k  €  </>>. 
8 , (r )  A  +  £  r,b!''  and  A*!/-)  A  £  8,(r).  When  P  has  no  forced  constraints  we  define 

/-i  />./„ 

A  „(/•)*’' 1,1  to  be  1. 

K  -  I 

Note  that  the  parameters  in  /)  depend  on  the  choice  of  nullity  set  A  and  normality  vector 
/>"”.  However,  as  v < />  )  =  v(/>  I  for  c  €  (where  v < A>  )  denotes  the  optimal  value  of  /)  1. 
the  optimal  value  of  /)  is  independent  of  the  choice  ol'  V  and  Thus,  for  any  nullity  set  A 
and  normality  vector  h""  for  /> ,  the  distribution  function  of  v<D, )  is  identical  to  the  distribu¬ 
tion  function  of  v(£> ),  where  v ( /),  )  is  the  stochastic  variable  that  takes  on  the  value  v(/)  ) 
when  c/  takes  on  the  value  c/. 

To  obtain  the  distribution  function  and/or  moments  of  v(Z), )  we  shall  first  obtain  the 

density  function  of  the  random  vector  /.  A  (/.„./. , . /.,,)’  and  L  A  ( , . L,,Y  where. 

for  /  €  <d>,L,  is  the  random  variable  that  takes  on  the  value  log  A'  (?,/>''  ’I  when  c/  takes  on 
the  value  ?/. 

3.  On  the  Density  Functions  of  /.  and  L 

Unless  otherwise  stated,  throughout  the  remainder  of  the  paper  we  shall  assume  the  fol¬ 
lowing: 

(1)  \e, \v  €  <  u  >  |  is  a  set  of  positive-valued  random  variables  such  that,  for  eve  y 

"  ii 

i  6  <n> .  c  =  «,  e,"  lor  some  « ,  t  (0,  °°)  and  R ,,  d  (-00,00),  (>  d  <  u  > .  lurther- 

.  - 1 

more,  it  is  assumed  that  (log  c,.  ..  .  log  <■„)'  is  a  nondegenerale  normal  random  vector  with 
mean  vector  m  =  1 . ;|r>d  dispersion  matrix  \: 

(2)  7  here  exists  a  nullity  set  (A1 ' '1/  €  <r/>|  for  P  such  that  { s' ' '|./  €  <</>]  is  linearly 
independent  where  s1'1  A  R'h11'  for  i  (  <d>  and  R  is  the  n  *  11  matrix  whose  ( 1.  /)  entry  is 

R  . 
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(3)  Foi  c..  .  -,.lue  Z'i  that  c,  takes  on  the  program  P  is  superconsistent  and  soluble; 

(4)  For  every  value  ct  that  c,  takes  on  the  program  D  has  a  unique  optimal  point  8(  and 

8,  >  0. 

t 

Many  of  the  results  obtained  under  the  above  restrictions  form  the  basis  to  approaches  for 
calculating  the  distribution  function  and/or  moments  of  v(P, )  under  less  restrictive  assump¬ 
tions.  This  will  be  briefly  indicated  in  Section  6. 

Assumption  (1)  allows  for  the  possibility  that  a  cost  coefficient  c,  is  constant  (fin  =  0  for 
all  v  €  <u>).  Also,  (1)  permits  one  to  conveniently  work  with  a  vector  of  stochastic  cost 

coefficients  c,  =  (c . c,  )'  for  which  (log  cl{ . log  c,  )'  is  a  degenerate  normal  random 

vector.  Degeneracy  would  occur,  e  g.,  if  C)  and  c2  where  components  of  ct  such  that  o  =  acf 
for  some  a  6  (0.  °°)  and  (i  €  Rl. 

To  evaluate  the  mean  p.,  and  variance  <r;  of  log  e,  a  cost  analyst  could  apply  steps  (I) 
through  (4)  of  Section  1  to  e,  in  place  of  c,.  After  choosing  f ,,  the  median  of  e,,  and  the  vari¬ 
ance  of  e,  by  these  steps  the  values  of  /x,  and  <j  ;  can  easily  be  calculated  [2], 

Note  Assumption  (2)  is  satisfied  if  u  -  n  and  c,  =  e,  for  every  /  €  <u>.  Also,  if  there 
exists  a  nullity  set  of  P,  that  satisfies  (2)  then  every  nullity  set  of  P,  satisfies  (2)  (Proposition 
1). 

Recall,  for  F  €  R",,  P  is  called  superconsistent  iff  there  exists  t  €  R"'  such  that  r  >  0 

m  tl 

and  i  ?,  n  »;■'  <  1  for  every  k  €  <p>.  Also,  P  is  called  soluble  iff  P  has  an  optimal 

nj,  (- 1 

point.  It  can  easily  be  shown  that  P  is  superconsistent  for  all  F  €  P>  iff  there  exists  a  linear 
combination  of  the  columns  of  A ,  say  .v,  such  that  x,  <  0  for  all  i  €  T„,  k  €  <p>  [1,  p.  329], 
Alternately,  one  can  show  that  P  is  superconsistent  for  all  F  €  P>  iff  the  set 
{8  €  P"|  8,  >  0V  i  €  <n>.  A' 8  =  0,  and  q' 8  =  1)  is  bounded  [1,  p.  329],  Moreover,  if  the 
above  set  is  bounded  and  contains  a  point  8  >  0  then  P  will  be  superconsistent  and  soluble  for 
every  F  €  P>  (by  [4.  p.  120,  Th.  21  and  [1,  p.  329)). 

Assumption  (3)  implies  that  v(P  )  =  v(D  )  for  every  value  c,  taken  on  by  c,  ([4,  p.  117, 
Th.  11). 

Assumption  (4)  holds  for  F  €  P>  if  P  is  soluble  and  has  no  forced  constraints.  More 
generally,  one  can  show  (4)  holds  at  F  €  /?>  if  P  is  a  superconsistent  soluble  program  whose 
forced  constraints  are  nonredundant  and  whose  forced  constraint  gradients  are  linearly  indepen¬ 
dent  at  optimality.  By  nonredundant  we  mean  that  the  optimal  value  of  P  is  greater  than  the 
optimal  value  of  PK  for  every  k  €  <p>,  where  PKi  denotes  the  program  obtained  from  P 
by  deleting  forced  constraint  k. 

If  the  components  of  L  form  a  set  of  independent  random  variables  then  we  obtain  a 

d 

simpler  formula  for  the  density  function  of  L  since,  in  this  case,  #(/, . Ij)  =  n  *,</,> 

;-l 

where  g  is  the  density  function  for  L  and  g ,  is  the  density  function  for  If,  in  addition,  La  is 
independent  of  the  components  of  L  then  the  calculation  of  moments  of  v(P, )  is  simplified. 
This  follows  from  the  fact  that  one  can  express  v(P,.)  as  the  product  e  "a»(r)  where  u»  is  a 
known  function  cr  a  </-dimensional  random  vector  r  whose  density  function  can  be  calculated 


OPTIMAL  VALUE  OF  STOCHASTIC  GEOMETRIC  PROGR  AMS 


555 


from  that  of  L.  Thus,  when  L0  is  independent  of  L  we  have  Ev(v(Pl))  -  |£‘(e  "']  |£Miu(r))] 
where  E"(Q)  denotes  the  i<th  moment  of  random  variable  Q  (whenever  this  moment  exists) 
If  L0  is  a  linear  function  of  the  components  of  L  orie  can  obtain  a  function  w  of  r  such  that 
v(Pc)  -  ai(r)  from  which  one  can  calculate  E',{\(PC)).  It  will  be  shown,  under  the  previously 
listed  assumptions,  that  one  can  always  find  a  nullity  set  \b{j)\j  €<d>)  and  normality  vector 
b(0)  for  Pc  such  that  L0  is  independent  of  L  if  s(01  9  span  {s0,|y  €<d>)  and  £0  is  a  linear 
function  of  the  components  of  L  if  s<0)  6  span  {S^’ly  6  <d> }  where  s(01  A  p 'ft10'  and  ft101  is 
any  normality  vector  of  Pc. 

Theorem  1  indicates  how  to  obtain  a  nullity  set  for  £,,  [bu'\j  €  <d>),  such  that  the 
components  of  the  corresponding  random  vector  L  are  independent  normal  variates  whose 
means  and  variances  are  known.  Also,  using  the  above  nullity  set,  it  is  shown  how  to  obtain  a 
normality  vector  for  £,,  b{0> ,  such  that  if  s<01  9  span  (s(,l|y  €  <d> }  then  the  components  of 
the  corresponding  random  vector  L  are  independent  normal  variates  whose  means  and  vari¬ 
ances  are  known.  The  proof  of  Theorem  1  follows  immediately  from  Lemma  A  which  is  stated 
and  derived  in  Appendix  A.  The  proof  of  Lemma  A  uses  the  eigenvectors  of  the  dispersion 
matrix  A.  Fortunately,  however,  the  calculation  of  the  above-mentioned  nullity  set  and  nor¬ 
mality  vector  and  the  calculation  of  the  means  and  variances  of  the  corresponding  variates  £,, 
j  €  <d>,  do  not  require  any  eigenvector  or  eigenvalue  calculations. 

THEOREM  1.  (a)  Define  lbij)\j€  <d> }  inductively  by  bn>  A  bn>  and,  for 

1  <  j  <  d,  bij)  A  b{))  -  £  «p'bu\  p  7>'n  >  A)~‘  (<p'bij  ’,  p'b,n>Jbu\  where 

/-i 

<XT’>  \  A  x'Ay  for  (jc,.y)  €'R“x  Ru.  Then  {h(''ly  €  <d>)  is  a  well-defined  nullity  set  of 

Pc- 

(b)  Define  b(01  A  h(0'  -  £  «P'bu\  P'b{l'>  0  ~'  (</3 'bu\  ir  swi  9  span 

/-i 

{.s</>| j  €  <  </>  |,  b(0>  A  bm  otherwise.  Then  b,0)  is  a  well-defined  normality  vector  of  P(. 

(c)  For  every  j  6  <d>  let  £;  denote  the  random  variable  that  takes  on  the  value  log 

Kl(c,b{,))  when  c,  takes  on  the  value  c,.  Also,  define  L  A  (Lt . Ld)'  and 

L  A  (Ln,L\,  ,  L,<Y.  Then  L  is  a  normal  random  vector  with  independent  components. 

Additionally,  £  is  a  normal  random  vector  with  independent  components  if  v  9  span 
{s,nlj  €  <«/>). 

(d)  For  every  j  €  <d>  let  g,  denote  the  density  function  of  L,.  Then  g, (/)= 
(t),n/27t)_i  exp  (—  (2tj ,2) “ 1  (/  -  v,)2)  for  every  /  €  /?',  where  v,  is  the  expected  value  of  L ,  and 

n 

t)2  is  the  variance  of  Lr  Furthermore,  v,  =  <p.,/3'h,'l>  -  £  a,b/''  and  t),  =  <p'b''\ 

i-i 

P'b{,>>  \  for  every  j  €  <d> ,  where  <  •,  >  denotes  the  usual  inner  product  on  R“. 

Throughout  the  remainder  of  the  paper  we  define  b{"  and  L ,  for  j  €  <d>,  L ,  and  L  as 
in  Theorem  1.  We  also  denote  K^c.b*1 ')  by  K,(c)  for  /  €  <d>. 

We  shall  now  show  that  if  there  exists  a  nullity  set  of  P,  that  satisfies  Assumption  (2) 
then  every  nullity  set  of  £,  must  satisfy  (2). 

PROPOSITION  1:  If  Assumption  (2)  holds  then  for  any  nullity  set  €  <d>)  of 

P,  the  set  Is1'  ’I./  €  <  d> )  is  linearly  independent  where  s1' 1  A  P'bu  ’  for  every  j  €  <d> 
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PROOF;  Lc,  ..  '|y  E  <d>]  be  a  nullity  set  for  Pe  such  that  ls(y’|y  E  <d>)  is  linearly 

independent  where  s(J  '  A  /3'6<y  ’  for  every  j  €  <d>.  Let  B  A  span  {ft^’ly  E  <</>)  and  S  A 
span  {sij)\j  €  <d>).  Define  T  to  be  the  unique  linear  transformation  for  B  to  S  such  that 
T(b{J))  —  siJ)  for  every  y  €  <rf>.  Thus,  since  /3'6<y)  -  /s(y)  for  every  j  €  <</>,  one  has 
T(6)  -  07>for  all  6  €  B. 

Since  {ft0)|y  €  <d>\  is  a  nullity  set  for  Pc  one  has  span  {i0)|y  E  <d> )  -  B.  Thus, 
for  every  j  €  <d> ,  7’(6t/>)  —  /3'6(y)  —  s0>.  Note  T  is  an  isomorphism  from  Z?  onto  5  and 
(ft<J  1y  €  <d>)  is  linearly  independent.  Hence,  (s^’ly  E  <d>  \  is  linearly  independent. 

Next  we  consider  the  assumption  s(0)  S'  span  {.v,y,|y  E  <d>\. 

PROPOSITION  2:  Assume  S(01  ?  span  {s(y)|y  €  <d>).  Then  for  any  nullity  set 
{b{,)\j  6  <d>  |  and  normality  vector  bt0)  for  P,  one  has  st0>  9  span  { v  0 )  ly  €  <d>\  where 
s(j  1  A  /3'6('  ’  for  every  y  6  <d>. 

PROOF:  Define  B  A  span  [biJ)\j  €  <d>  j  and  S  A  span  {s</,|y  6  <</>].  Let  f  be 
the  unique  linear  transformation  from  B  to  S  for  which  7'(/>('))  —  s<J  1  for  every  y  £  <d>. 
Since  /3'6( ' 1  =  s(/ 1  for  every  y  E  <  </>  one  has  Tift)  =  p'b  for  all  b  £  B. 

Since  {6</,|y_€  <<2>1  is  a  nullity  set  of  Pc  one  has  span  |6<y,|y  £<d>)  C  B  Also, 
since  b101  and  b(0>  are  normality  vectors  of  P ’.  it  follows  that  b,0>  -  b101  E  span 
{ft^’ly  E  <</>},  i.e.,  ba))  E  B.  Thus,  for  every  j  E  <d> ,  ft1'*  E  B  and  hence,  T(b{>')  - 
/3'b(l  1  =»  S*'  \  Finally,  observe  T  is  an  isomorphism  from  B  onto  S  since  sl0)  ?  span 
{s(,)l j  €  <d_>\  and  {.s<,’|y  E  <d> )  is  linearly  independent  by  Assumption  (2).  Moreover, 
lh('l|y  €  <d>]  is  linearly  independent.  Thus  {.?<,,|y  E  <d> )  is  linearly  independent. 


As  mentioned  earlier,  when  u  =  n  and  c*  =  e,  for  every  /  £  <u>  then  Assumption  (2) 
holds.  In  addition  one  has  st0)  span  (s<y,|y  E  <d> }  and  hence  by  Theorem  1  the  com¬ 
ponents  of  L  are  independent.  We  next  consider  the  case  where  Assumption  (2)  holds  but 
sl0>  E  span  (s('  ’|y  E  <d>\. 


PROPOSITION  3:  Assume  s(0)  E  span 

d 

j  E  <d>  such  that  s(01  —  £  yjSij>  where  s 

/-i 

d  n 

L0  -  £  +  D  where  D  is  the  constant  £ 


{v<,)!y  E  <d>}.  Then  there  exist  yt  E  R]  for 
(l)  AB'b(']  for  every  j  E  <d>.  Furthermore, 


log  a,. 


i-t 


PROOF:  Since  v(0)  E  span  {s(y,|y  E  <d>),  by  Proposition  2  there  exist  v,  E  R1  for 

a  rf 

j  E  <d>  such  that  s’01  —  £  y;s(y\  Thus  s/01  —  £  y,s,U)  for  every  i  E  <u> . 

i-\  /-i 


f0> 

"”))) 

n 

By  Lemma  A,  Part  (iii),  L0 

A  log  Afn(c)  - 

log 

(exp(L  (e,s 

-  I  ft,101  log 

U 

n 

U 

1  J 

/  “■  i 

i 

n 

/"  1 

d 

a,  +  £  log  e.-Y.  b,m 

1  1 

log  a,  +  X 
,  _  1 

log  f,-  £ 

ft/0’ 

log  a,  +  £ 

U 

n 

f™  1 

d 

n 

/“  1 
d 

y,  £  s,tn  log  e, 

-  I  6,,m 

log  a,  +  X 

y,L  (e,s  '  ) 

- 

I  */m 

log 

a,  +  L  -v< 

/-l 

/-i 

/-] 

/-l 
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Lj-  ±  b,tj)  log  a, 


i-1 


£  y,L,  +  D  where  Oi  £  ft,""  log  a,  -  £  v, 
7-1  ”  <-i  ,-\ 


£  ft,1' '  log  a, 

i-l 


br-Y.yXj) 

>- i 


log  a,. 


We  next  consider  the  (/-dimensional  random  vector  r  A  (r, . r,,)’  mentioned  earlier 

whose  density  function  can  be  used  to  obtain  moments  of  v ( /*, ) .  To  define  r  assume  <7  takes 
on  the  value  Z7.  Then,  by  Assumption  (4),  Z>.  has  a  unique  optimal  point  8  .  Since 
(ft(j)|y  €  <d> )  is  a  nullity  set  and  ft<0)  is  a  normality  vector  of  Pk  there  exists  a  unique  point 

j 

r-  A  (/■)(?),  ....  rtl(c))'  €  RJ  for  which  8(  =  ft"11  +  £  ( r, (c ) ) ft1 '  ’.  We  define  r  to  be  the  ran- 
”  /-i 

dom  vector  that  takes  on  the  value  r-  when  ct  takes  on  the  value  c/.  In  the  next  section  we 
shall  obtain  the  density  function  of  r  from  that  of  L.  Also,  when  v""  ?  span  { s * ' 1 1 >  €  <d>\. 
we  shall  obtain  the  density  function  of  v(P()  as  a  marginal  density  of  (v(PL).r).  The  density 
function  of  ( v (/*,), r )  is  obtained  from  that  of  L. 


4.  THE  DENSITY  FUNCTIONS  OF  rand  v(/>f) 


Assume  C/  takes  on  the  value  ch  Since  8  is  an  optimal  point  for  D  with  all  positive 
components  (Assumption  (4))  it  follows  that  hc  satisfies  the  maximizing  equations  for  D  [4,  p 
88,  Th.  3],  Expressing  8-  in  terms  of  the  components  of  r  ,  the  maximizing  equations  can  be 
written  in  the  form  log  A,(c)  =  ft,(r)  for  every  j  £  <d>  where,  for  j  €  <d>,  ft,  is  the 
function  defined  in  Theorem  2.  The  above  equations  will  be  used  to  obtain  the  density  func¬ 
tion  of  r  from  that  of  L.  From  the  above  maximizing  equations  one  can  easily  show  that  the 
optimal  value  of  P  satisfies  the  equation  log  A'(l(c)  »  log  (v(P  ))  +  ft„(r  )  where  ft(l  is  defined 
as  in  Theorem  2  [4,  p.  88,  Th.  3).  This  equation,  together  with  the  maximizing  equations,  will 
be  used  to  obtain  the  density  function  of  ( v(Pt),r )  from  that  of  L  when  s""  9  span 
\su'\j  €  <</>). 


To  obtain  the  density  functions  of  rand  (vlP^.r)  we  shall  first  define  the  functions  ft,  for 
j  €  <d>  and  establish  several  of  their  properties. 

THEOREM  2:  Let  H  A  |r  €  RJ\Br  >  -ft"")  where  B  is  the  n  x  d  matrix  whose  ./th 
column  is  ft1'1  for./  €  <d>.  For  every  /  €  <d>  define  ft,:  H  —  R[  such  that,  for  r  £  H. 

h,(r)  A  £  ft/' 1  log  8,(r)  -  £  1  log  \„<r) 

“  i-i  «-t 

r  J 

(where  £  log  \*(r)  A  0  if  p  =  0)  In  the  above,  for  every  r  £  R‘{  8 ,(r)  A  ft,""  +  £ 

K-l  _  _  l-l 

r, ft, 1,1  for  i  €  <  n  >  and  A„(r)  A  £  8  ,(r)  for  k  €  <p>.  Also,  for  every  j  £  <d>  and 

=  "  J. 

k  €  <p>  ,  ft/ ' 1  A  £  ft/'1. 

*  !•  Z, 

For  every  /  €  <</>  define  ft,:  (0.°°)  x  //  —  such  that,  for  ( r, r )  £  (0.°°)  x  //. 

ft,(z,r)  A  ft,(r)  if  7  £  <</>  and  ftu(r.r)  A  log  z  +  ft„(r) 

Finally,  define  ft:  // —  and  ft:  <0.°o)  x  //  —  W' ‘ 1  such  that,  for  every 
(;.r)  €  (0.°o)  x  //,  ft(r)  A  (ft|(r) . ft,,(r)T  and  ft(z.r)  A  ( /),,(;. r  )  ,ft ,  (z.r  I . ft.,(r,r))' 

Then 
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(a)  //anti  n ... .  .untinuously  differentiable  in  H  and  (0,°°)  x  //respectively; 

(b)  ft  is  1-1  in  H  and  ft  is  1-1  in  (0,°°)  x  //; 

(c)  ft  is  onto  R‘‘\ 

(d)  If  s'0'  S’  span  {\(,,|/  €  <d>\  then  ft  is  onto  R'y+1. 

PROOF^  (a)  Clearly,  for  every  j  €<</>,  ft,  is  continuously  differentiable  in  H  and,  for 
every  j  €  <d> ,  fty  is  continuously  differentiable  in  (0, °°)  x  H.  Thus,  A  is  continuously 
differentiable  in  H  and  A  is  continuously  differentiable  in  (0,  °°)  x  H. 

(b)  Let  r  and  s  be  elements  of  H  such  that  hir)  =  A(s).  Note  8 (r)  >  0  and  8 (s)  >  0. 
Also,  since  (log  A'|(c) . log  A, /(c))'  is  a  nondegenerate  (/-dimensional  normal  random  vec¬ 

tor,  c i  takes  on  a  value,  say  ct.  for  which  log  A,(r)  —  h,ir)  =  h,(s)  for  every  j  6  <d>. 
Thus,  by  definition  of  A,  and  ft,  for  j  €  <d> ,  8(r)  and  8(s)  satisfy  the  maximizing  equations 
for  D  .  Also,  8(r)  and  bis)  are  feasible  points  of  D  .  Thus,  by  [4,  p.  88,  Th.  31,  8 ir)  and 
bis)  are  optimal  points  of  D  .  However,  by  Assumption  (4),  £>-  has  only  one  optimal  point 
and  hence  bir)  =  Sis).  This  implies  r  =  s  since  the  nullity  set  [b[''\j  6  <d>)  is  linearly 
independent. 


Next,  let  (z(,r)  and  (z2.-s)  be  elements  of  (0,°°)  x  //such  that  hiz\.r)  —  hiz2.s).  Then, 
hir)  =  his),  and  hence,  r  =  s.  Also,  hniz\.r)  =  h0iz2,s).  Hence,  by  the  definition  of  An. 

log  z,  =  hn(Z|,r)  -  £  ft,101  log  b,ir)  +  £  A“u  log  A„lr) 

/-i  «-i 

=  ft„(z,,s)  -  £  b,""  log  8,(s)  +  £  X.11"  log  X,  (s) 

(-1  x  -  I 

=  log  Z  2 

and  thus  Z\  =  zj. 

(c)  Let  w  €  A7.  Since  (log  A|(<) . log  A</(t  ))'  is  a  nondegenerate  (/-dimensional 

normal  random  vector,  q  takes  on  a  value,  say  ch  for  which  log  A,(r)  —  u,  for  every 
j  €  <d>.  By  Assumption  (4),  D  has  an  optimal  point  8  such  that  8  >  0.  Let  r  be  the 

1  ,i 

unique  element  of  RJ  for  which  8  =  ft<("  +  £  rtb{' '  and  denote  8  by  8(r). 

i-i 


Since  ft ( r )  >  0  one  has  r  €  H  Furthermore,  since  8(r)  is  an  optimal  point  of  /)  ,  by  [4. 
p.  88,  Th.  3)  bir)  satisfies  the  maximizing  equations  for  Z)  .  Thus,  for  every  j  €  <d>. 
h,ir )  =  log  K,(c)  —  ur  Hence,  h  is  onto  R'1. 

(d)  Assume  ?  span  { ,v ' ' 1 1  >  €  <d>].  Let  //  —  («| . u,,)’  6  RJ  and  u„  £  Rl  By 

Theorem  I,  (A',|(c).  A,(<  ) . A, /(<))'  is  a  nondegenerate  id  +  I  (-dimensional  normal  ran¬ 

dom  vector  Thus,  takes  on  a  value,  say  c/,  for  which  log  K,ic)  -  u,  for  every  /  f  <d> 

By  Assumption  (4),  D  has  an  optimal  point  8  such  that  8  >  0.  Let  r  be  the  unique  element  of 

./ 

R'1  for  which  ft  -  ft""  +  ^  r,ft(' '  and  denote  8  by  fi(r)  Let  A  v(D  ) 
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Since  B(r)  >0  and  v(£L)  >  0  one  has  ( rQ,r )  6  (0,«)  x  H.  Also,  since  8(c)  is  an 
optimal  point  of  D c,  by  [4,  p.  88,  Th.  3]  S(r)  satisfies  the  maximizing  equations  for  D(.  Thus, 
for  every  j  €  <d> , 

(1)  hj(r0,r)  -  hj(r)  =  log  K;(c)  -  w(. 

Also,  by  [4,  p.  88,  Th.  31,  since  8(r)  satisfies  the  maximizing  equations  for  Dt  one  has 

n  _.<m  p  .  (oi 

r o  -  v(Dr)  -  A0(?)  n  8,(r)  '  n  '  • 

/-I  K— 1 

Thus,  log  r0  =  log  K0(c)  -  £  V0'  loS  Me)  +  £  lo8  (A„ ( r ) ) ,  i.e. 

i-i  «-i 


(2)  ho(r0,r)  —  log  A'o(c-)  *  t/n. 

By  (1)  and  (2)  it  is  onto  Ra+I. 

Note  by  Theorem  2,  for  every  /  €  R‘‘  there  exists  a  unique  point  r,  E  //  such  that 
/  “  h(r/).  Thus,  we  can  define  h  Rd  —  H  by  h~'(l)  A  r,  for  I  €  RJ.  Also,  if  \101  S'  span 
{sl/‘|  j  €  <d>l  then  by  Theorem  2,  for  every  /E  there  exists  a  unique  point 

( Zj.r-/ )  6  (0,«j)  x  //such  that  /  =  Mz, Thus,  when  s101  ?  span  (s''  ’I./  f  <d>j.  we  can 
define  h  /?<#+1  -  (O.oo)  x  //by  /?  '(/>  A  (z^r,)  for  7  6  RJ". 

PROPOSITION  4:  (a)  r  -  h  '</.); 

(b)  (v(P,.).r) -*"'(£)  if  s“"  «  span  ls',,|y  6  <d>). 

PROOF:  (a)  Let  c,  take  on  the  value  <7.  Then  L  takes  on  the  value  /  A  (log  AT  j  (D . 

log  Kd(c)Y.  By  Assumption  (4)  and  [4,  p.  88,  Th.  3 J  one  has  r  €  H  and  log  K,(c)  —  /t,(r  ) 
for  every  j  €  <d> .  Thus,  h~Hl)  —  r ...  Hence,  h  '(L)  takes  on  the  value  r  when  q  takes 
on  the  value  C/,  i.e.,  h~'(L)  -  r. 

(b)  Assume  s<0)  ?  span  {s1' ’I/  E  <</>}.  Then  ft  Rd+]  —  (0.  °°)  x  //  is  well-defined 

Let  c/  take  on  the  value  c,.  Then  L  takes  on  the  value  1  A  (log  A0(c),  log  A^c) . log 

Kd(c)Y.  Note  \(P,)  takes  on  the  value  v ( P  )  >  0.  Also,  by  Assumption  (4)  and  14,  p.  88. 
Th.  3l  one  has  r-  €  H  and  log  K,(c )  -  h,  (v(P  ),  r  )  for  every  j  €  <d>  Thus,  h'[(l)  - 
(v(P-),  r f).  Hence,  h  ' ( Z )  takes  on  the  value  (v(P  ),  r  )  when  17  takes  on  the  value  ch  i.e., 
h  ’(£)  -  (v(/”r).r). 

We  can  now  obtain  the  density  function  of  r. 


THEOREM  3:  Let  </<  denote  the  density  function  of  rand  j?  denote  the  density  function 
of  L  Then 


i/»(r) 


0  i  r  r  S'  //, 

!gO)(r)>|(|dct  T>/r  ( c )  I )  if  r  E  //. 


where  Dh(r)  denotes  the  derivative  of  h  at  r. 
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I’KOOI  t  el  1 7  lake  on  the  value  c,.  Then  stakes  on  the  value  r  A  Ir, . r, ) '  where 

i  ' 

fi  =  />""  +  £  r, />* ' 1  is  the  unique  optimal  point  of  D  By  Assumption  (4),  fi  >  0.  Thus, 

i 

Hr  >  -/>"".  i.e..  /  £  //  Hence,  /-can  only  take  on  values  in  //.  Thus.  <liir)  =  0  if  r  9  II 

l  et  H  he  an  open  Borei  subset  of  //.  Note,  by  Proposition  4, 

Brir  €  H)  =  Prill  '(/.>  £  H)  =  PriL  €  hiB)). 

By  Theorem  2,  h  is  1-1  and  continuously  differentiable  in  B  Also,  g  is  integrable  on  hiB) 
since  g  is  the  density  function  of  /..  Thus, 

PrU.  £  h(B))  =  f  g  =  f  (jf  °//)|det  Dh\ 

J  h  l  H  I  J  H 

[12,  fhs  3-1.'  and  3 - 1 4 1 .  where  g  °h  denotes  the  composition  of  g  and  h  Hence.  Prir  (  IV  = 

f  ( g  /i  t'det  Dli',.  This  implies  >/»(/•)  =  (e<  /;(/))[( Idet  Dliir)  |)  for  r  t  II 
J  ii 

Nest  we  obtain  the  density  function  ol  (v(/*  l.rl 

TIILORLM  4:  Let  i/<  denote  the  density  function  of  i\iP).r)  and  £  denote  the  density 
function  of  L.  Assume  .v'01  <7  span  {j,,l|./  £  <</>[  Then 

0  if  ( ;,r )  d  (0.  oo  )  x  //, 

(.£>(//  (r.r  >)}  ( Itlct  /2//(r.r)|)  if  <-’./•)  £  <<>.«>>  x  //. 
where  Phiz.r)  denotes  the  derivative  of  /tat  iz.r) 

I’ROOI  By  the  proof  of  Theorem  )  it  follows  that  let/'  ).r)  can  only  take  on  values  in 
II), io)  x  //  Hence,  t'hiz.r)  =  0  if  iz.r)  d  (O.oo)  x  II 

Let  B  be  an  open  Borei  subset  of  (0.°°)  x  //and  deline  e  A  v(/’  ).  Note  by  Proposition 
4,  Priiz.r)  £  B)  =>  Prill  '(£)  €  B)  -  Bril.  €  hiB))  By  Theorem  2.  h  is  1-1  and  continu¬ 
ously  differentiable  in  B.  Also,  is  integrable  on  hiB)  since  g  is  the  density  function  of  / 

Thus.  Pril.  €  hiB))  =  f  g  =  f  ig  °//)|det  Ph  \  [12,  Ths  3-13  and  3 - 1 4 1 ,  where  g  "h 
J  Mfii  J  ft.  r 

denotes  the  composition  of  g  and  h.  Hence,  Priiz.r)  £  B)  «*  ig  °h)  Idet  /Wr|.  I  his 
implies  ibiz.r)  =  I  giliiz.r)))i  Idet  Dhiz.r)  |)  for  iz.r  )  €  (0.  °° )  x  //. 

When  v11"  d  span  { s  ‘ '  ’  I  /  £  <</>!  the  above  theorem  immediately  yields  the  density 
function  of  v(  B  ). 

COROt  I.ARV  4  I  Let  /  denote  the  density  function  of  \  I  B  I  and  assume  v  d  span 
I  s'  1 1  /  f  <  J  >  |  Then 

0  if  r  d  (0.  'X) ) , 

f  |,i'(//(r,rtl|(|det  Dhiz.r)\)Jr  ,f  r  (  <()- 00  1 

J  r  ■  If 


>!iiz.r)  = 


Observe  that  to  evaluate  /  at  r  £  (().«>)  by  Corollary  4  1  one  must  integrate  a  specified 
lunction  over  the  convex  polyhedral  set  //  =  (r  €  R‘\Br  >  6  "I  When  the  degree  ol 

difficulty  (/equals  I  then  II  will  be  an  interval  in  B  1  whose  end  points  can  easily  be  obtained 
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Thus,  when  d  »  I.  one  can  accurately  approximate  J(z)  by  applying  a  quadrature  formula  lo 
evaluate  the  integral  expression  for  f(z).  However,  the  quadrature  rule  must  be  modified  as  in 
(7,  Ch.  7.  Sec.  6. 2 1  to  allow  for  the  fact  that  the  integrand  is  not  defined  at  the  end  points  of 
the  interval  of  integration. 

When  d  >  l  the  effort  and  expense  of  devising  and  applying  a  quadrature  scheme  to 
approximate  the  integral  expression  for  /(z)  to  a  high  degree  of  accuracy  may  not  be  justified 
since  frequently  the  distributions  chosen  for  the  stochastic  c,  will  be  subjectively  determined. 
In  such  cases  a  numerical  Monte  Carlo  method  could  be  an  attractive  alternative  for  approxi¬ 
mating  the  multiple  integral  used  to  express /(r)  (6,  14,  15]. 

Finally,  under  the  assumption  of  Corollary  4.1  note  the  distribution  function  of  v(F,), 
denoted  by  F,  is  given  by 

0  if.v  <  0, 

FO)  =  c  , 

J  \g{h(z.r))  (  det  Dh(z.r) \)drdz  f  _ 

dl;.r l(  n  if  v  >  0. 

Thus,  if  great  precision  is  not  required  a  numerical  Monte  Carlo  technique  could  be  attractive 
for  approximating  F(y)  as  well  as  /(z). 

5.  THE  MOMENTS  OF  \(PC ) 

In  the  following, Tor  each  random  variable  Q ,  recall  EU  '(Q )  denotes  the  moment  of  order 
i •  of  Q  whenever  it  exists,  where  i>  €  N  (the  set  of  positive  integers).  Also,  let 
EIQ)  A  E("{Q) 

Throughout  Section  5  we  assume  EU'(\(,P,))  exists  for  every  v  €  N.  Proposition  B  in 
Appendix  B  establishes  that  boundedness  of  the  dual  feasible  set  F  A  (6  6  K"|^'8-0, 
q' 8  =  1,8,  ^  0  v  i  €  <  n  >  |  is  a  sufficient  condition  for  the  above  moments  to  exist.  Furth¬ 
ermore,  one  can  show  P  is  superconsistent  for  every  c  €  /?>  iff  Fis  bounded  (see  p.  554). 

To  calculate  the  moments  of  v(F, )  it  is  advantageous  to  use  the  density  function  of  r 
instead  of  that  for  v(F, ).  To  obtain  the  moments  of  v(F, )  in  terms  of  the  density  function  of  r 

we  shall  use  the  function  w.  H  —  R'  defined  by  <u(r)  A  for  r  €  H  Thus,  ca(r)  -  fj 

i  ~  I 

»  -/,«»  P  K  <o» 

if  p  =  0  and  a >(r)  =  J"J  8,(r)  XK(r)  *  if  p  >  0,  for  r  €  //. 

I  -  l  K-l 

PROPOSITION  5:  v(F,  )  =  c'"a>(r). 

PROOF:  Let  <7  take  on  the  value  17.  Then  c'  "w(r)  takes  on  the  value  K„(c)<u(ri )  where 

.1 

r  =  (/•).  ....  rjY  is  the  point  in  //  for  which  8  =  h""  +  £  r,bu  1  is  the  unique  optimal  point 

i-i 

of/),.  Since  8  >  o  one  has  A'n(o)<a(r  )  =  v(P  )  |4.  p  88.  Th.  3],  Thus,  v(  P, )  =  r>(  "o>(  r ) 

We  shall  now  obtain  the  moments  of  v<  P  )  when  s""  £  span  !  '  'I  /  £  <d>) 

THEOREM  5:  Assume  i""  £  span  |sl,,|./'  €  <d>)  Then,  for  every  r  £ 

F1’ '(v(P, ))  -  J"  ^  <  r )]’«/» ( r)  dr  where,  for  r  €  //. 


I 
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e°  flM')"' 


if  p  —  0, 


en  Yl  Kjr)  ^  if  p  >  0. 

l-t  K-l 

n  it  li 

In  the  above  D4  £  b,'01  —  £.v,6,(,‘  log  a,,  u,  A  £  1  —  6,(0>  for  /  6  <n>,  and 

“  /-i  ,-i  ~  ,-i 

,/ 

v„  A  £  1  - A""  for  k  €  </>>  where  (vi . v(/)'  is  Ihe  unique  elemeni  of  R'1  for  which 

-!/ 

s10’  -  £  V'1- 


PROOF:  We  shall  assume  p  >  0.  (The  modification  needed  in  the  proof  for  p  =  0 
should  be  clear.)  By  Assumption  (2)  and  Proposition  1  the  set  {s,'l|y  €  <</>!  is  linearly 
independent.  Hence,  by  Proposition  3  there  exists  a  unique  element  (v, . vj)'  of  R‘‘  for 

i  /  i / 

which  s"”  =  £  Also,  by  Proposition  3,  log  /f,,(c)  =  £  y,  log  A’,(c )  +  Z)  where 

,-i  ^  ,-i 

Z)  A  ]T  6,""  -  £  v,6('' ']  logo,.  By  Proposition  5  v(P,)  =  A0(r)to(r).  Thus, 

(1)  log  (v(P, ))  =  log  A0(c)  +  log  (<a(r)) 

=  £  y,  log  *(<c>  +  Z)  +  £2  A""  log  \K(r)  -  £  6/"’  log  8 ,(/•). 

/-I  «-l  ;-l 

Let  c/  take  on  the  value  c,.  Then  r  takes  on  the  value  r  =  (r,(c) . r(/(c))'.  Since 

8t  A  6""  +  £2  (r,(c))ft1' 1  is  an  optimal  point  of  Dt  and  8,  >  0  (by  Assumption  (4))  one  has 

/-i 

(by  [4,  p.  88,  Th.  31) 

log  A',(c)  =  Zi,(r  )  =  £  6,' ' 1  log  8 ,(r  )  —  £  M 1  log  Mr,  ) 

/-i  «-i 

for  every  /  €  <ii>.  Thus,  by  (I), 

log  <v(P  ))  =  £  J£  A,1'1  log  8,(r  )  -  £  ' 1  log  Mr  ) 

I-' 

+  Z2  +  £  C"  log  \.(r  )  -  £  6,""  log  8,(r  ) 


r>  +  I  I  vA1"'  8,  log  8 ,  ( r  ) 


Hence, 


£  £  .v,A,l' 1  -  A.""  log  A„(r  ). 

»-i  /-i 


v(P, )  *  c"  n8  (r)  n  A«(r)  *  <L(r). 

-t  «-t 

It  follows  that  t:“  ’  ( v  ( /',  ) )  -  Z;((v(P,  )|’)  -  f  (<o(r)|'i/i(r)</r  |8.  p.  18,  Th  1.4.31 

J  t*  // 
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We  next  obtain  the  moments  of  v(/*t)  in  terms  of  the  density  function  of  r  when  s'1"  S' 
span  { i ‘ ' 1 1  /  €  <</>). 

THEOREM  6:  Assume  s""  G  span  Is1'1!./  €  <d>)  and  let  v  €  N.  Then, 

(a)  e"  is  lognormal  and  independent  of  R  A  iu(r): 

(b)  ))  =  E'r'{e‘")Eu  1  (R  ); 


u 

n 

■» 

J' 

..  <0»  ^  L  (01 

+  y  (Si{))  \s"") 

1 

.•  - 1 

where  su"  = 

(d)  £"  1  (K>  =  J  {«u(r))'iMr)<//-. 

PROOF:  (a)  By  Theorem  1  is  normal  and  hence  c1 "  is  lognormal. 

Note  by  Proposition  4  iu(r)  =  mUi  l(L  1)  where  L  =  . L<tY .  By  Theorem  1  L„ 

and  /.are  independent.  Thus,  <•  "and  R  are  independent  [8,  p.  15,  (Ill)). 

(b)  To  show  that  E(R‘ )  exists  let  ,V  A  e  1,1  and  >  A  R‘ .  Clearly  (.V,  >  )  is  a  continuous 
random  vector.  Thus,  let  w  be  the  joint  density  function  of  (A.  E).  Also,  let  w,  and  w,  denote 
the  marginal  densities  of  .V  and  >  respectively.  Then  =  M|(.v) hj(>')  for  all 

( v. i  )  €  Rl  x  R 1  since  A' and  >  are  independent  by  (a). 


'  V 


Proposition  5  [v(P  > }‘ 1  =  XV.  Thus,  by  assumption,  E(XY)  exists  and  hence, 
xyw(x.y)d\dy  is  convergent.  Thus,  by  Eubini's  Theorem  1 1 0,  p.  207,  Th  2.8.7] 

vnvt.v, v)tlxdv  =  f  //()•)</»•  where  // (  v  )  A  f  ath  (,x\ v)(/v  =  wcdcl  f 
Jo  =  Jll  ■  Jo 


■ill-  K  1  >  K  1 

){/.v  =  in,(i •)/.'( A’)  This  implies  E(XY) 
is  convergent,  i.e.,  l'AR‘ )  exists. 


/.(A)  f 

J  II 


nvT  i  )<A  and  hence 


VH'dy  )dy 


Since  the  expected  values  of  o'*  "and  R‘  exist,  the  independence  of  o' 1 "  and  R‘  implies 
E  (<■' ' ")/:  (R‘ )  =  lAc‘l'R ')  [5,  p  82,  Th.  3.6.2)  Thus,  by  Proposition  5, 
£ 1 ' 1  ( v  ( /*  ) )  —  /•.(  |, •'"//)■)  =  /:(<■'  '")£<  R‘  )  =  Eu  1  (<•'  AR). 


(c)  Recall  by  Theorem  1  EM. „)  =  £  m,*,""  -  L  ;,ni1  »’</.„>  -  ,v""  As""  where 

•  i  ,  - 1 

v""  -  fi'b""  and  1  (/.,,)  denotes  the  variance  of  By  |2|  one  has  "(c  ")  -  exp  |r £'(/.„)  + 
|  i'  ’  1  (  /,n)l  since  /.„  is  normal. 

(d)  l  sing  the  density  function  i//  of  r  we  obtain  /•.'(//)  =*  EAR')  =  J  (<u  ( r  ))‘  i/y  (  r  )Jr 
|8.  p  18.  Th  14  31 

Note  that  to  evaluate  '<v(l*  ))  by  Theorem  5  or  6  one  must  integrate  a  specified  (unc¬ 
tion  over  the  convex  polyhedral  set  H  =  \r  i  R'\Rr  >  />"")  Hence,  the  comments  made 
in  Section  4  concerning  the  evaluation  of  /<r)  also  apply  to  the  evaluation  of  "(v  I /’  I)  In 
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particular,  note  that  for  a  given  precision  the  amount  of  work  required  to  calculate  £'' 1  ( v ( P  )) 
by  Theorem  5  or  6  should  be  about  the  same  as  the  amount  required  to  calculate  /  ( r )  by 
Corollary  4.1.  Thus,  in  calculating  £'‘'(v(P)),  it  is  advantageous  to  express  £"  '(v(  P  1 1  in 
terms  of  the  density  function  of  r  as  in  Theorems  5  and  6  rather  than  to  express  £''  (v(P  I)  as 


6.  EXTENSIONS 

In  this  section  we  shall  indicate  how  the  preceding  results  can  be  used  to  obtain  the  distri¬ 
bution  and/or  moments  of  v(P()  when  P,.  need  not  satisfy  all  the  assumptions  of  Section  5 
However,  no  formal  statements  or  proofs  will  be  presented. 

In  the  following,  we  shall  refer  to  strengthened  versions  of  Assumptions  (2),  (3).  ana  * 4 ) 
which  are  stated  below  for  a  stochastic  geometric  program  P  that  satisfies  Assumption  < 1 » 

We  say  P  satisfies  Assumption  (2‘)  iff  P  satisfies  Assumption  (2)  and  .5""  <2  span 
£  <J> |  where  s' ' 1  A  fi'b1 ' 1  for  every  j  €  <d>.  Here  { £ 1 ' 1 !./  €  <•/>)  is  any  nullity 
set  for  P,  and  bu"  is  any  normality  vector  for  P  Also,  the  matrix  (1  is  defined  as  in  Assump¬ 
tion  ( 1 ). 

P  is  said  to  satisfy  Assumption  (.V)  iff  P  is  superconsistent  and  soluble  for  every 
F  €  R 

Finally.  P  is  said  to  satisfy  Assumption  (4'l  if!'  D  has  a  unique  optimal  point  8  and 
8  >  0  for  every  c  €  R ", . 

Now  consider  a  family  of  random  cost  vectors  |c(t)|*  €  where  <  <«  I  A 

(i|(t) . c„(e))'for«  €  (O.oo),  that  satisfies  the  following: 

(i)  (log  c  |  (e ) . log  c„U))'  is  a  nondegenerate  normal  random  vector. 

(ii)  /-.'(log  =  £(log  f, )  for  every  i  €  <n> ; 

(iii)  lim  C'ovdog  c,  (*),  log  <’,  (« ) )  =  Covdog  log  c  )  for  everv  (/,/)  (  <  n  >  *  <«>. 
•  l» 

where  Cov  denotes  covariance. 

Such  a  family  of  cost  vectors  can  easily  be  constructed  it  P  satisfies  Assumption  (II 
When  P  also  satisfies  Assumptions  (.!')  and  (4')  one  can  show  that  /', ,,,  will  satisfy  Assump¬ 
tions  (1),  (2'),  (.!').  and  (4’),  where  P, is  obtained  from  P  by  replacing  r  with  the  cost  sec¬ 
tor  c  («).  Thus,  the  results  of  the  preceding  sections  can  be  used  to  calculate  the  monte  its.  dis¬ 
tribution  function,  and  density  function  of  P, ,, ,  for  t  €  (O.oo)  Additionally,  one  can  establish 
that  the  moments,  distribution  function,  and  density  function  of  /’,,,  converge  to  the 
corresponding  moments,  distribution  function,  and  density  function  of  P,  as  t  tends  to  zero 

Next,  consider  the  family  of  stochastic  geometric  programs  { P ‘ v  1 1 -y  f  where,  lor 

y  £  (O.oo),  P'vl  denotes  the  following  stochastic  program 

mi  i  <  ft '  n  v 
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subject  to  i  n  i“"  +  zK  <  1  for  every  «€</>>,  1  >  0,  and  z  >  0  where 

t*j,  /-i 

/  =  (/) . and  z  =  (z, . z(,)'.  One  can  show  />,lyl  satisfies  each  assumption  that  P, 

satisfies.  In  addition,  if  P,  satisfies  (3')  then  P'Y>  satisfies  (3')  and  (4'j  (even  when  Pt  does  not 
satisfy  (4)).  Thus,  one  can  apply  the  results  of  the  preceding  sections  to  calculate  the  density 
function,  distribution  function,  and  moments  of  P'y‘  for  y  6  (0,°°)  when  P{  satisfies  (1),  (2'), 
and  (3').  Furthermore,  one  can  establish  that  the  moments,  distribution  function,  and  density 
function  of  P'y>  converge  to  the  corresponding  moments,  distribution  function,  and  density 
function  of  P \  as  y  approaches  zero. 

Finally,  for  y  €  (0,°°)  and  «  (0,°°),  let  P^t\  denote  the  stochastic  program  obtained 
from  P'y)  by  replacing  cost  vector  c  by  c(e)  in  The  family  of  cost  vectors 

|c(e)|  €  €  (O.oo))  is  assumed  to  satisfy  the  properties  (i),  (ii),  and  (iii)  previously  listed.  One 
can  show,  for  (>,«)  €  (0,°°)  x  (0,°°),  program  Pt(e)iy)  satisfies  (1),  (2'),  (3'),  and  (4')  if  Pt 
satisfies  (1)  and  (3').  Thus,  in  this  instance,  one  can  apply  the  results  of  the  preceding  sections 
to  P.V.V  This  suggests  that  the  family  of  programs  l/V^’  \y  €  (0,°°)  and  €  €  (0.°°)|  may  be 
useful  in  obtaining  the  moments,  distribution  function,  and  density  function  of  P,  when  P, 
need  only  satisfy  Assumptions  (1)  and  (3'). 

APPENDIX  A. 


Theorem  1  in  Section  3  is  an  immediate  consequence  of  the  following  lemma. 

U 

LEMMA  A:  Define  H;,s)  A  £  s,  log  z,  for  every  positive-valued  random  vector 

—  r-  I 

z  A  <Z| . z„)'  and  s  €  R“.  Also,  define  the  inner  product  <-,->s  on  R“  by 

<x,y>  \  A  a'Av  for  ix.y)  €  R"  x  R‘‘.  (Note  <-,->\  is  an  inner  product  since  A  is  a  disper¬ 
sion  matrix  of  a  nondegenerate  normal  random  vector  and  hence  is  positive  definite.)  Then 


(i)  (Z-lt’.s111) . He,s,J'))'  is  a  normal  random  vector  with  independent  components 

where  e  A  iet . s'"  =  s'",  and 


/  i 


£  i<su\s"'>  ,)  '  (<sl".sln>  x)s 


i- 1 


for  1  <  /  ^  d. 


(ii)  ( L  ((’,.v""), /.(<’, s'") . Lie.s1'"))'  is  a  normal  random  vector  with  independent 

components  if  s“"  ^  span  |  v ( ' 1 ! ./  6  <</>!  where  .v""  =  5""—  £  (<  .s ' ' 1 .  .s 1 ' 1  >  x)  1 

(  <  s"”..s,,l>  \)s"'  when  s"’’  ?  span  |.v‘'  ’ I /  €  <  d>  ). 


(iii)  For  every  j  €  <  d> ,  s1' '  =  13'b1' '  and  [j  c,  = 


/  i 


-t 


n« 


ft" 


{exp  {Lie. s'1  ’)))  where 


b"]  «  b"'  and,  for  1  <  j  ^  d.  b"  ’  =  ft1'  ’  -  £  <  < /3 . /3 'fr ' '  ’  >  0  1  (  <fi'b' ' '  ,(3’h'"  >  s>  b 

.ill  a'li/l 


Also  6"”  =  bm  if  s""  €  span  {s"'U  €  <d>)  and  b'"'  =  ft'0' 


£  i<P'b'l,.fi,b‘n>  s) 

I- I 


(</37>"”./37>m> \)b,n  if  s"”  9  span  Is*'  '\j  €  <d>\.  Furthermore,  (61' ']./  €  <d>]  is  a  nul¬ 
lity  set  and  is  a  normality  vector  of  P, ; 
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(iv)  For  j  €  <(/>,  the  density  function  <f>,  of  L  ie,s{' ')  is  given  by  <£,(/)  A 


U~P,)2 


.\f2rr 


exp 


for  every  I  6  Rl  where  p,  is  the  expected  value  of  Lie. s'1' )  and  to;  is  the  vari- 


2to  / 

ance  of  L  ie.s'1 ').  Furthermore,  p ,  =  £  p.s'1 1  and  to;  =  <su  ',s('  ’>  v 


PROOF:  Since  A  is  real  symmetric,  A  has  an  orthonormal  set  of  u  eigenvectors 

I Pi . pu).  Let  P  be  the  u  x  u  matrix  whose  yth  column  is  pr  Then  P  is  orthogonal  (i.e., 

P  1  =  P ')  and  A  A  P~ 1  A  P  is  diagonal. 

For  every  /  €  <u>  let  y,  A  log  e,  and  y  A  (y, . y„)'.  Let  y  A  P'y.  Then  y  is  a  u- 

variate  normal  vector  with  dispersion  matrix  P'AP  =  A  ([8,  Th.  2.1.1] )—  Since  A  is  diagonal, 
the  components  of  y  are  independent 


Let  (s.w)  6  R“  x  Ru  such  that  a  =  P's.  We  shall  show  Lie.s)  =  Lie, a)  where  e  A 

( e i . and  e  A  e  for  i  (.  <u>  Note  y  =  Py.  Thus,  for  ;  €  <u>,  y,  =  £  p,K y„. 

«  -  ! 

Hence. 


(1) 


/.  ( < ,  s  i  =  £ 


Ip.’ 


-  I 


If.' 


y  » ,  > ,  *  / 1 1*.  h  t 


For  every  /  *  *  ,/  >  d.nnc  a  A  P  s  ’  By  assumption  {s’ '  ’ |>  €  <d>  |  is  linearly 
independent  Thus  |a  i  •  <  J  >  I  is  linearly  independent  since  P'  is  nonsingular.  Thus,  one 
can  apply  the  (iram-Schmidt  orthngnnali/ation  process  to  |h-,,i|./  e  <d> )  to  obtain  the 
orthogonal  set  j  a  /  ►  <■  d  » I  with  respect  to  <  .  >  N  where  a1"  A  a  "’  and,  for  1  <  j  <  d. 


(21  a  I  A  a  ^  <  a  1 .  a  '  >  ^  I  1  (  <  a 1 1  ' ,  w 1 1 '  >  ^ )  w 1 1 1 . 

.  •  I 

(Note  <  ■,  >s  in  the  inner  product  on  R‘‘  defined  by  <.v,.r  >  ^  A  .v'A  c  for  (x,y)  6  R"  x  P" 
<  •,  ■>  x  is  an  inner  product  on  R"  since  A  is  the  dispersion  matrix  of  the  nondegenerate  nor¬ 
mal  random  vector  y  and  hence  is  positive  definite  )  Also  define 


(3) 


A  a 


j 

I 

/-i 


(  <  a  ‘ 


’>*)  1  1  <  a101, w<(l >  a-* 


if  s’""  ?  span  |sl,l|/  €  <c/>|;  otherwise  define  a101  A  a"".  Observe  if  s""  1  span 
{ i 1 ' 1 1./  6  <t/>|  then  a""  ?  span  | ' 1 1./  €  <d>).  Thus,~| w1'  '| /  6  <d> }  is  an  orthogonal 
set  in  P"  with  respect  to  <  •,  •>  v  when  s101  ?  span  I?1'  ’  1/  €  <d>  |. 


Define  s,MA  Pa1'1  for  every  j  €  <d>.  Then,  for  /  €  <d> ,  a,(|l=  P's'1'.  Thus,  by 

(1), 

(4)  L  (e.s1'  ’)  =  L  ie.w1'  ’)  for  every  j  €  <d>. 

We  shall  next  show  (Lie, a,MI) . Lie, is  a  normal  random  vector  with  independent 

components.  Also,  whenever  sini  span  { s 1 ' 1  |y  €  <d> ),  we  shall  show  that 
( L  (e,Hw)  ,L  (e,  aMI) . L  (e,aIJ'))'  is  a  normal  random  vector  with  independent  com¬ 

ponents. 
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For  every  /  €  <u>  lei  and  r,  be  the  variance  and  mean,  respectively,  of  log  e,.  For 

—  u 

i  €  <u>  define  il/,  A  tf,_!  (log  e,  —  t,).  Note,  for  every  j  €  <d>„  L  (e, k(/ log 

/-i 

u  11  — 

c,  =  £  », *v,‘ ' 1  <1/,  +  £  t.w/'1.  Let  r,r  €  <d>  such  that  r  ^  r.  Recall  y  4  (log 

<-i  ,-i 

c, . log  eu)'  is  a  normal  vector  with  independent  components.  Thus,  €  <u>j  is  a 

set  of  independent  unit  normal  random  variables.  Thus,  Lie,wlr))  and  Lie, wU))  are  norma! 

14 

random  variables.  Moreover,  Lie,  w  ‘ )  and  Lie.w'")  are  independent  provided  £ 

,-i 

0;wl,,,w,t"  =  0  [8,  Th.  4.1.1,  p.  70], 

Since,  for  every  i  €  <u>,  0}  is  the  variance  of  y,  and  A  is  the  dispersion  matrix  of  y 

14  _ 

one  has  £  tf/V'V'1  =  <  wl'1,wm>  By  construction  of  (tv'"| j  €  <d>\  one  has 

i  =  i 

<  ^  =  0  for  r,t  €  <  d  >  with  r  ^  t.  Also,  <  ^  =  0  for  r,i  6  <  d  >  with 

r  ^  i  provided  s"1*  €  span  (s'' '  |y  €  <d> ).  Hence,  by  (4),  ( Lie,s{U ) . Lie,sun)Y  is  a 

normal  random  vector  with  independent  components.  Also  by  (4),  (L(e,slUI), 

Lie.s{X)) . L(f,s,J,)r  is  a  normal  random  vector  with  independent  components  if  i101  ? 

span  {5°  '\j  €  <d>  ). 

Next,  let  (x{,'y')  €  R"  x  R"  for  /  €  ( 1 , 2)  such  that  yx'  1  =  P’xu '.  Then, 

(5)  <v'V?’>  *  =  <  P'xix,,P'xi2i>  ^  =  (xn,)'PAP'x,2>  =  <x(l,.x,2l> 


Observe  s'"  =  /V111  =  Pw"'  =  s'".  Also,  by  (2)  and  (5),  for  1  <  ./  ^  d  one  has 

(6)  s'"  =  Pw'n  =  Pw(”  -  £  Kwt,\w,''>K)Pwtl' 

i- 1 

=  S'"  -  £  (<s,/,.s,/‘>  ,)  ’(<  S" 

/-i 

Moreover,  by  (3)  and  (5)  one  has 

(7)  ,■">’=  />».'•»' -  .5"”-  £  (<s',,.s'M>y  1  (<s"".s,,’>  ,)s(n 

/-i 

if  s"11  #  span  (s'1  ’|y  €  <<7  > !.  This  completes  the  demonstration  of  (i)  and  (ii). 


Next,  we  shall  obtain  a  nullity  set  |fr'"|y  €  <d>)  and  normality  vector  bll,x  for  P,  such 


that  .v1' 1  =  li'b'1 1  and 


n« 


(exp(  7.  (c,s(' ’)))  for  every  ./  6  <d>.  Let  S  4 


span  { s * ' 1 1 /  €  <d>)  and  Zf  A  span  I./  €  <</>-}.  By  assumption  { s ' ' 1 !./  €  <d> )  is 
linearly  independent  and  hence  a  basis  for  .S'.  Thus,  there  exists  a  unique  linear  transformation 
7  .S  —  B  such  that  T(s‘")  =  bx‘ 1  for  every  /  €  <d> .  Since  |A"  1 1./  6  <  */> )  is  a  basis  for 


B,  T  is  an  isomorphism  from  S  onto  B  Also,  since  T  Ub 
I  €  <d>  one  has  T  '(/))  =  fi'b  frr  every  b  €  B. 


')  - 


=  fi'b  for  every 


Recall  s'"  =  s'1’  (  S  Let  I  <  /  <  d  and  assume  s'M  €  S  for  1  ^  /  <  /.  Then  by  (6) 
one  has  s1'1  €  S.  Hence,  (s'11!/  €<</>)  C  .S'.  Also  (s'' ' ( /  €  <c/>|  is  linearly  independent 
since  {«'  '[/  6  <d>\  is  orthogonal  with  respect  to  and  P  is  a  nonsingular  matrix  for 
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which  s{,)  =  Ptv  ’ '  lor  every  j  €  <d>.  Thus,  (s(,l|y  €  <d> )  is  a  basis  for  A.  Since  T  is  an 
isomorphism  from  A  onto  fl,  {7'(s,,,)|y  €  <d>  \  is  a  basis  for  B.  Thus,  \bu'\j  €  <d>  |  is  a 
nullity  set  for  Pe  where  b[l  *  A  T(sU))  for  j  6  <d>. 

Let  A  A  span  { s * 7 1 1 7  €  <d> }  and  B  A  span  [b{l)\j  6  Suppose  s_<0’  9  S.  Then 

there  exists  a  unique  linear  transformation  T:  A  —  B  such  that  T(s(/>)  =  b1'1  for  every 
j  6  <d>.  Also,  by  (7),  sl0)  6  A.  Thus,  we  can  define  bi0>  A  ffs10’).  Note  by  the  definition 
of  Tone  has  f(s)  =  T(s)  for  every  s  €  A.  Thus,  by  (7), 

(8)  bm)  =  t(sta))  -  £  «s('\su)>0^«stm,su)>0nsu') 

/-i 

=  bi01  -  £  «s,''y/l>,)-l(<.i<0l.s,',>,)6,', 

/-1 

since  T(su))  =  T(s(n)  —  bu)  for  l  €  <d>.  For  j  €  <m>  let  At  denote  column  j  of 
exponent  matrix  A.  Recall  q  6  R"  such  that  q,  =  1  if  /  €  <  n o>  and  </,  =  0  if  /  >  «n  where  rt« 
is  the  number  of  elements  in  7«.  Then  by  (8),  for  every  j  6  <m> ,  one  has  /f(>  =  0 

since  6<01  is  a  normality  vector  and  (6(,l|/  €  <  d>  1  is  a  nullity  set  of  Pt ,  where  <  •,  ■>  denotes 
the  usual  inner  product  on  R".  Also,  <bi0),q>  =  <bi0>,q>  =  1  since  <bil],q>  =0  for 
every  /  €  <d>.  Thus,  6,(l1  is  a  normality  vector  for  Pt.  If  s(01  €  A  we  define  ft101  A  blw. 
Thus,  whether  s<("  9  S  or  s(01  €  A  one  has  that  bm  is  a  normality  vector  for  P(  . 

To  show  p'bij)  =  sU)  for  every  j  €  <d>  first  observe  for  j  €  <d>  one  has 
/37>(/)-  7W|(6(,I)=  T~\(T(su  ’)  =_s<,).  Next  suppose  v<0)  9  A.  Then  f  is  an  isomorphism 
from  A  onto  B  since  [bU)\j  €  <d> )  js  linearly  independent  and  f(sin)  =  bU)  for  every 
j  €  <d>.  Also,  T~'(bU))  —  v0)  =  p'b(l  ’  for  every  j  €  <d>.  Hence,  7^'(6)  -  P'b  for  all 
b  €  B.  Note  6(0>  A  f(s,0))  6  B.  Thus,  P'bi0)  -  7~1«>,0,)  =  !(f(s,m))  =  s(0).  Finally,  sup¬ 

pose  ,v,0)  €  A.  Then  B'b  (0)_=  p’bi0)  -  C(0’  -  />w(0)  -  ftv(0)  -  s,0).  Thus,  from  the  above, 
/3'6(,  )  =  s*'1  for  every  j  €  <rf>. 


Next  let  y  €  <rf>  and 


"  kh  <  «  «  «  |n,'"  "  *'■ 

observe  fl  T  =11  =  11“-' 

"  «»t  ,-i  .-1  I  1-1 

.•  Id, A"')  *  „ 

[Jr'"1  Thus,  since  B'b1''  *»  s1'1,  one  has  c, ' 


£  s'1'  log  J  U  n«‘;"  |exp(L  (e.s1'  ’))). 


Note  =  ns11’)  =  nTw"’)  =  7'(/>h'"’)  =  ni(")  =  Also,  by  (6),  for 

1  <j^d  one  has  b(l>  -  T{s(’ ’)  -  ')  -  £  (<s,,'.s,,’>  x)  1  (<  s"  '.sln>  0 

1  /-I 

T(s{n)  =  ft*'  ’  —  £  (<0'b{'\p‘bu)>  O  '  (< P’b,l\P'b'n>  0b,n.  Recall  if  s,ni  6  span 
(s,/'|y  €  <rf>(  then  bwi  A  6(01.  If  s(nl  9  span  { s ' 7  ’ |./  €  <d>)  then  by  (8)  one  has 
b(0)  -  bm>  -  £  (<p 'bin,B'bU)>  0~'(<P'bw,.P'b(n>  0b,n. 

/-I 

This  completes  the  demonstration  of  (iii). 
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Finally,  lei  j  6  <d>  and  recall  L(e.su')  =  jT  s,{,>  log  er  Thus,  p,  =  jP  and 

/-i  /-i 

oj;  =  <s{l s{j)>  \  [8,  Th.  2.1.1,  p.  29]  since  (ju, . /*„)'  is  the  mean  vector  and  A  is  the 

dispersion  matrix  of  e.  Also,  since  L(e,sui)  is  normal,  one  has  <£,(/)  =  - — \=  exp 


(/  —  P  ,)2 


2o>, 2 


i  l'j2n 


for  every  /€/?'. 


APPENDIX  B. 


PROPOSITION  B:  (a)  If  H  is  bounded  then  £,,  )(v(/>, ))  exists  for  every  £  <V. 

(b)  H  is  bounded  iff  the  set  F  A  (8  £  /?  "|8,  >  0  V  /  €  <n> ,  A  '8  =  0,  and  c/'S  =  1 }  is 
bounded. 


PROOF:  (a)  Assume  //is  bounded.  Then  for  every  j  €  <r/>  there  exist  real  numbers  I, 
and  u,  such  that  /,</■<  u,  for  every  r  €  H.  Let  v  €  N  and  define  A  v(P, ).  Finally, 
assume  c,  takes  on  the  value  F/  and  recall  r  =  ( r , (F ) . rt/(c))'  is  the  element  of  H  for 

,i 

which  8  A  6(m  +  £  (r,(c))bu  1  is  the  unique  optimal  point  of  D  . 

i  - 1 


Since  8  is  the  optimal  point  of  D  ,  by  [4,  Ch.  3,  Sec.  3]  and  Assumption  3  one  has 
ri  A',(h)'  ’  n  S,(r  )  "v',nx.(r)"v'1 


where  {"[  A  I  for  r  €  //,  the  closure  of  //,  if  />  =  0.  Define  r:  H  —  R1  by 

K-l 

r*  ^  |  r  |  /’  X  ( f  I  _ 

r(r)  A  J~|  8,(r)  '  A„(r)  “  for  r  €  //.  In  evaluating  r(r>  use  the  convention 

1  =  1  K=  1  _  _ 

.v v  =  a  '  =  I  for  v  =  0.  Then  r  is  continuous  on  H.  Thus,  since  H  is  compact,  there  exists 
U  €  (0.°°)  such  that  0  <  r(r)  <  U  for  every  r  €  H.  Hence, 

(I)  0  <  H  U‘  K„(c)‘ 

I  ~  I 


I.  Assume  \m>  €  span  { s * ' 1 1 /  €  <*/>).  Then  by  Proposition  3  there  exists  v,  €  R  1  for 

i/ 


/  6  <</>  and  W'  €  (0.«>)  such  that  K{)(c)  =  XV  [J  .  Thus,  by  (1), 

j 

(2)  0  <  r'  ^  (t/Jf')'  fl  K‘{z) 

/  =  i 


i  =  i 
iii  ir  i,  n 


Let  j  €  <</>.  If  0  <  K,(T)  <  1  then  A.', (D* '  ’  s?  A,(e)' '  A,(F)' ' .  Also,  if 

I  <  A',(F>  then  A,(e)"' X',(F)"'I‘ '  A,<c)"  A,(F)' Thus.  0  <  A,(F) . 

^  Z,(c)  A  max  (A,(c)’  '  ’’  .  A,(c)'  “  ).  Hence,  by  (2), 

./ 

(3)  o  <  <  iuwy  fj  z, ( r ) . 

/-i 

Moreover,  by  the  choice  of  (6,'ll./  £  <(/>),  the  variates  A,(c>  for  ./  £  <</>  are  independent 
and  hence  ( Z,  (<  )  I  /  £  <</>)  is  a  set  of  independent  variates. 
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By  definition  ui  Z,(c)  one  has 
(4)  0  <  Z,(c)  f 

Since  A',(c)  is  lognormal  so  are  K  ,{c)' and  A,(c)'  Thus,  the  expecied  values  of 

these  two  variates  exist  and  hence  so  does  E(K,(c)‘  1  +  K,(c)‘  ).  Thus,  by  (4),  the 

expected  value  of  Z,(c)  exists.  Since  the  variates  Z,(i )  for  j  £  <</>  are  independent  the 

</ 

expected  value  of  (UWY  [~[  Z,(c)  must  also  exist  (5,  p.  82,  Th.  3.6.2],  Hence,  by  (3),  the 

/-i 

expecied  value  of  exists. 


II.  Assume  s""  ?  span  { s’ ' 1 1./  €  <r/>).  Then  by  the  choice  of  (A1' 1 1 /  €  <d> )  the  vari¬ 
ates  A,(c)  for  j  €  <d>  are  independent.  For  y  €  <d>  define  Z,(c)  A  max 

(A',(c)  ',A ,(c)  "')  and  let  Z0(F)  A  U'  K^CcY  ■  Then  the  variates  Z,(c)  for  ./  6  <d>  must  be 
independent.  Furthermore,  0  <  K,(c)‘  ,U  ^  Z,(c)  for  j  6  <</>.  Hence,  by  (1). 

(5)  0  <  <  n  z/(r)- 

/-« 


Note  that  £'(Z0(c))  exists  since  Z0(c)  is  lognormal.  Also,  for  /  €  <d>.  £'(Z,(<)) 
exists  since  A,(c)  is  lognormal  and  0  <  Z,(c)  <  A,(c)‘  +  A', Since  the  variates  Z,(cl 
for  /  €  <</>  are  independent  it  follows  from  (5)  that  £(r,‘  )  exists. 

By  1  and  II  we  conclude  £*' 1  ( v ( /*, ))  exists  for  all  e  e  A. 

_  ,/ 

(b)  Observe  8  f  /  iff  there  exists  /•  =  (r, . r ,) '  6  //  such  that  8  =  />""  +  £  r,£"’. 

_  /-i 

Also,  //  is  bounded  iff  H  is  bounded.  Thus,  /  is  bounded  iff  //  is  bounded. 
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A  CLASS  OF  CONTINUOUS  NONLINEAR  PROGRAMMING  PROBLEMS 
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ABSTRACT 

V  general  class  of  continuous  tune  nonlinear  problems  ts  considered 
Necessary  and  sulVicicnl  conditions  lor  the  existence  ol  solutions  are  esta¬ 
blished  and  optimal  solutions  arc  characlcri/ed  in  terms  of  a  duality  theorem 
I  he  theory  is  illustrated  h>  means  ol  an  example 


1.  INTRODUCTION 

Recently  Farr  and  Hanson  HI  proved  existence  theorems,  duality  theorems,  and  continu¬ 
ous  time  analogues  of  the  Kubn-Tucker  Theorem  for  a  class  of  continuous  time  programming 
problems  in  which  nonlinearity  appears  both  in  the  objective  function  and  in  the  constraints 
More  recently  this  class  was  extended  in  Fair  and  Hanson  1 2 ]  to  include  problems  with 
prescribed  time  lags  in  the  constraints.  In  this  paper  we  generalize  these  results  by  considering 
a  more  general  form  of  the  constraints  and  by  assuming  a  less  stringent  constraint  qualification 
This  constraint  qualification  is  analogous  to  that  of  Kuhn  and  Tucker  15 1  and  provides  further 
unification  between  the  areas  of  finite-dimensional  and  continuous  time  programming  An 
example  is  presented  wherein  these  results  arc  applied  to  a  version  of  Koopmans'  |4|  water 
storage  problem  which  has  been  modified  to  address  the  economic  ramifications  of  an  energy 
crisis. 

2.  THK  PRIMAL  PROBLKM 

I  he  problem  under  consideration  l Primal  Problem  A)  is: 

Maximi/e 

I  <r)  =  f  </>  <r (H.tUli 

J  n 


subject  to  the  constraints 


(1) 

.it)  >  0. 

n  ^  i  g  r. 

(2) 

/(:(/). t) 

<  hi  i  1  :,t).t) 

s7f 


574 


fW  Ki  ll  AND  AM)  M  A  HANSON 


and 

(3)  :(t)  =  0,  i  <  0. 

where  r  6/.,”  |0.  T\,  i.e.,  r  is  a  bounded  and  measurable  //-dimensional  function;  y  is  a  map¬ 
ping  from  L“  10,  T]  x  (0.  7"]  into  Ep  defined  by 

(4)  i  (-f)  =  £  J  Xi(:(s  —  «,).  s  -  a,)(h: 

i- a 

./ (r  (/),/).  h(y(:,t).i)  6  £"';  £, (r  (s  —  a,),  s  — a,)€  £'’.  /  =  0 . r.  The  set  0  =  an  ^ 

a  i  ^  ...  <  a,  is  a  finite  collection  of  nonnegative  numbers,  and  6  is  a  scalar  function,  concave 
and  continuously  differentiable  in  its  first  argument  throughout  |0,  7  ). 

It  is  further  assumed  that  each  component  of—/,  and  h  is  a  scalar  function,  concave 
and  differentiable  in  its  first  argument  throughout  |0,  £],  that  each  component  of  the  composite 
function  h(y(.i),i):L^{0,T]  — *  £"'  is  concave  in  r,  there  existsS  >  0  such  that 

(5)  either  VA/(r)./)  =  0  or  Va/(t),/)  >  8, 
and  for  each  t  and  A  there  exists  /A  =  /A  (/)  such  that 

(6)  VA/t  (r,y)  8, 

where 

Va/,(tj./)  =  df,(r).i)/d  t)a. 

/  =  1 . ///,  A  =  1 . //, 

for  r)  €  /:",  r;  ^  0.  and  /  €  |(),  £); 


(7) 

K,(:  (/)./)  =  0.  i  <  0, 

7-0 . r; 

(8) 

V/,//,(i>,/)  =  dh,(i’,i)/d ^ 

(). 

lor  i- 

€  f: 7  and  /  €  [0. 7*1;  and 

(9a) 

sup  //,(().£)  <  sup  V 

\  It,  (0,1 )  <  oo,  /=  1 . ///. 

IK.  IK  1  IK.  '•  1 

A  =  1 . //, 

(9b) 

sup  <00,  sup 

U«  f  /  •»•■•/ 

V**„(0./>  <  oo.  /  =  o.  ... 

q  —  1 . p.  k  =  \ . 

n. 

(9c) 

inf  /,(()./ )  >  /  =  1 , 

....  /w. 

O'  f  / 

(9d) 

sup  VA(/>  (T),/>  <  oo,  r)  € 

£".  r,  3?  0.  A  =  1 . //. 

ii-  ;•  / 


A  function  r  f.  /.„'  |(),  7  |  is  termed  feasible  for  Primal  Problem  A  if  it  satisfies  the  con¬ 
straints  ( 1 ).  (2).  and  (3)  The  primal  problem  is  itself  said  to  be  feasible  if  a  feasible  r  exists 

It  should  be  noted  that  Primal  Problem  A  is  identical  to  that  considered  in  1 2 1  it  />  =  m 

and 

hiy  (:./>./)  =  /,„  yiz.t) 
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where  lm  is  an  w-dimensional  identity  matrix. 

3.  EXISTENCE  THEOREM 

THEOREM  1:  If  Primal  Problem  A  is  feasible,  then  it  has  an  optima)  solution,  that  is, 
there  exists  a  feasible  e  for  which 

F(:r)  =  sup  F(r). 

where  the  supremum  is  taken  over  all  feasible  c. 

We  preface  the  proof  of  this  theorem  with  a  brief  discussion  of  weak  convergence  and  two 
lemmas. 

Let  X  be  a  normed  linear  space  and  denote  by  X *  the  collection  of  all  bounded  linear 
functionals  on  X.  If  we  define  the  norm  of  an  element  /  €  X*  by 

11/11=  sup  |/Lv)| 

lull- i 

and  define  addition  and  scalar  multiplication  of  linear  functionals  in  the  obvious  manner,  then 
X*  is  a  Banach  space  and  is  commonly  referred  to  as  the  dual  space  of  X.  A  sequence  |.v„)  in  A 
is  said  to  converge  weakly  to  x  6  .V  if  ,/  (.v„)  —  /(a  )  as  n  -<»,  for  every  /  €  X*. 

LEMMA  1:  Let  the  uniformly  bounded  sequence  of  scalar  measurable  functions  (</,/<  r ) | , 
d  =  1,2 . converge  weakly  on  [0,7")  to  t/()(  / ) .  Then  except  on  a  set  of  measure  zero 

qnit)  ^  lim  sup  Qjii). 

PROOF:  See  Levinson  161 

LEMMA  2:  If  q  is  a  nonnegative  integrablc  function  for  which  there  exists  scalar  con¬ 
stants  U\  ^  0  and!/  >  0  such  that 

q(ti  ^  #,  +  ft 2  f  q(s)ds.  0  ^  i  ^  T. 

then  qU)  ^  #|  c":'.  0  <  t  ^  I 

PROOF:  See  Levinson  (6| 

PROOF  OF  THEOREM  1:  L  et  r  be  feasible  for  Primal  Problem  A  and  multiply  the  con¬ 
straint  (2)  by  the  w-dimensional  vector  (1 . 1)  to  obtain  the  inequality 

nt  »i 

£  /,(;(/). r)  £  It, (v  (r, / ) ,/ ) .  0  ^  t  <  T. 

t=  I  »  =  1 

From  the  convexity  of  each  /,  in  its  first  argument,  if  follows  from  |8,  p  2421  that 

m  m  m 

£  /,<r(r).r)  >  £  ,/;co.n  +  £  «*<#»  -mo. 

/=l  i=l  A- 1 


ak(t)  -  £  V  k /,((),/). 


where 


$76 


T  W  REILAND  AND  M  A  HANSON 


Set  Wo  = 


max 


0. 


sup 
n.  10.  n 


£ ./;  (o./ ) 

i-i 


by  (9c)  and  observe  that  by  assumption  (6) 


A  =  inf  min  aAi)  >  0. 

i  k 

Since  r  is  feasible  and  therefore  satisfies  constraint  (l),  it  then  follows  that 

n  m 

(10)  A  £  ;*(/)  <  W„  +  £  0  ^  i  <  T. 

A-l  /-I 


Define 

lV.i>,  ( TJ ,  s )  I  =  |V  ;A’w(rj,s')|/lv,(,  for?)  €  E".  s  €  [0.  7'].  j  =  0 . r. 

and 

IV/r («■./)!  =  (V,/i, <i-. Ion-  €  FJ'.  i  €  [O.n. 

and  set 

(i(z.t.s)  -  £  /(.v)  I V /; (t  ( r. r ) , r ) ] a’, ( c ( a ) . .v ) 

( _  (|  l«.r  ->/l 

and 

//(.-, i, a)  -  £/  (a)  lV/;(r(r./).f)||Vj?J(r(s).s)] 
where  //  (■)  is  the  indicator  function  of  the  set  E 

Since  h  and  y,  are  concave  in  their  first  arguments  it  follows  from  |8)  and  from  (.1),  (7)  and  (8) 
that 

h(y(:,i).i)  /; (0, r )  +  f  (HO.i.s)ds  +  f  //«U.v)r(.v)«/s. 

0  0 


By  (9a)  and  (9b)  we  select  ^  0  and  >  0,  such  that 


sup 


m  w  p  t  I 

£  h,(0.t)  +  £  Jn  O', (O.r.s )<7a 


^  W, 


and 


sup  max 
* 


£  //,*(0./.a) 


<  w,. 


from  (10)  we  have  that  W*  =  (W„ +W|)\-1  and  w  *  =  W V A  arc  nonnegative  and  positive  con¬ 
stants.  respectively,  for  which 


£  ;k  <r )  <  W r  +  W 2  f,  £  <v )ds.  ()!?  K  7 

*-i  i-i 

f  ront  l  emma  2  we  conclude  that 

(ID  £:»</><  exp  <«tr)  Wfexp  <«?n.  0  ^  /  $  7. 


and  hence  the  set  of  feasible  solutions  for  Primal  Problem  A  is  uniformly  bounded  on  10.7  ) 


Since  <b  is  concave  and  differentiable  in  its  first  argument  throughout  |(). T|.  it  follows 
from  I9d>.  I X |  and  the  uniform  boundedness  property  that,  for  any  feasible  solutions  rand  r". 
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V(z)  —  f''(r°)  ^  T  V  sup  (zA(r)  —  rA"  ( / ) )  sup  VA<f>(r"(/ )./ )  < 

A-1  '  ' 

and  hence  V  is  bounded  above  for  all  feasible  z. 

Let  T  =  lub  L(r),  where  the  least  upper  bound  dub)  is  taken  over  all  feasible  r  Then 
there  exists  a  sequence  (c‘/l  of  feasible  solutions  such  that 

lim  f  ir1')  =  F. 

,/  -.-o 

Since  {ri  is  uniformly  bounded,  it  follows  from  1 10)  that  there  exists  a  r  to  which  a  subse¬ 
quence  of  Iri  converges  weakly  in  lO.Tl.  Denote  this  weakly  convergent  subsequence 
itself  by  lr‘V,  the  application  of  Lemma  1  to  each  component  of  z'1  then  provides  uniform 
boundedness  for  :  except  possibly  on  a  set  of  measure  zero  where,  as  will  be  shown  later,  it  can 
be  assumed  to  be  zero. 

Since  each  component  of  the  composite  function  //  (.v  (•,/>./)  is  concave  in  z.  it  follows 
from  (8 1 ,  (3)  and  the  chain  rule  for  differentiation  that 

h(v(:J,i).t)  sj  h(vCz.i).t)  +  f  fUz.i.s  UzJ(s )  -  cl.vDi/s,  0  <  /  <  I. 

J  U 

Since  each  entry  of  the  m  x  n  matrix  lliz.i. s)  is  bounded  and  measurable,  it  follows  that  each 
row  f/,iz.t.s)  Z  [0. 7)  C  (0. 7|  and  so,  bv  weak  convergence, 

f  //(;./..v)  L"'(.v)  -  :<v)W/.v  -  0,  as  </  -  «>. 

J  U 

Thus,  by  constraint  (2) 

(12)  lim  sup  Hz''(i),i)  ^  lil\  < r. r ) . r ) ,  0  C  t  ^  I. 

,i 

Define  (V/Itj.MI  =  <<Va,/,(t |.r)),„,„,  r/  f  r/  >  0;  by  the  convexity  of  / 

ft z‘'(r).i)  Js  /(z(i).r)  +  IV f  <:(/>. r)l  <;"'</>  -  r</>>.  I)  C  /  $  /. 

Therefore,  from  (12). 

(13)  /  (z  </>./>  ^  hiytz.i)./) 

except  on  a  set  of  measure  zero,  since  by  [Hi,  assumption  *5)  and  Lemma  I  we  have 
lim  sup  |V  /(;(/),/)  I  (rJ( / )  -  2(D)  7;  o 

,i  ... 

except  on  such  a  set 

A  second  application  of  Lemma  I  to  each  component  of  z'1  prov  ides 
-  5  ( r )  ^  lim  sup*  S'  0,  a.c.  in  |()./|, 

,i  ■  ■ 

and  consequently  z  is  nonnegative  except  on  a  set  of  measure  zero  front  this  result  and 
expression  (13).  we  observe  that  r  can  violate  the  constraints  ol  Primal  Problem  A  on.  at  most, 
a  set  of  measure  zero  in  10.  /1  We  define  r  to  be  zero  on  this  set  of  measure  zero,  as  well  as 
for  r  <  0.  and  equal  to  5  on  the  complement  of  this  set  I  he  feasibility  ol  .'  is  then  establislu.  d 
by  noting  that 

.1  <Z.I>  =  1  (Z.l).  0  <'  I  <  / 

lint  sup  t  < :‘'i I )./ 1  •  /  01. z  1 .  0  '  t  '  f 


and  that 
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bv  the  convexity  lstraint  (I ).  arui  assumption  (6). 

By  the  concavity  ami  tlifl'erenliahtlitv  of  </> 

J  <i>  (z‘'U).t)di  <  J  '  <t>  (:U).i)di  +  J  '  (c'(/)  -  3</))'  V<t>(:U).i)di. 

I  herefore.  by  weak  convergence 

r-  iini  f  ' 

.  “'ll 

£7  f  '  6l:in.i)Ji  =  l  ie). 

J  n 

By  the  definition  of  I  and  the  feasibility  of  3.  1(c)  ^  I',  thus  t(3)  =  l  and  3  is  an  optimal 
solution  for  Primal  Problem  A.  q  j;  d 


4.  VVKAk  1)1  AI.ITV 

Before  the  dual  to  Primal  Problem  A  is  formally  stated,  a  continuous  time  l.agrangian 
function  and  its  I  rechet  differential  will  be  introduced 

loi  u  ■  I  '  [0.  / 1  and  a  (  I  .,',  Id. /  I.  define 
U4)  I  in.  nt  f  |</>(i/ (r  )./)■<-  u '(/)/  ( //,r  )|(/r 

w  here 

I  i  II.:  i  h  ( i  in. I  >.t )  /((/(/)./).  0  <  I  $  /. 

and  let  <s  /  ln.it.  yi  denote  the  I  rechet  differential  |7|  with  respect  to  its  firs:  argument, 
evaluated  a'  u  with  die  increment  y  *  /  |0. 7 1  The  differentiability  of  each  of  the  functions 
involved  m  /  insures  that  the  I  reihei  differential  exists  and  allows  <5,/.  (//,«•;  y)  to  be  deter- 
mmevl  by  the  simple  differentiation 

I  I  5  )  b  ,/  (//.  it  .  y  •  Uli  *  ciy.it  >1,,  ii. 

till 

I  he  I  rechet  differential  has  two  addilion.il  properties  that  will  be  used  in  the  ensuing  discus¬ 
sion.  namely,  the  linearity  of  .5 ,  / .  t  u.  n  ,  y  i  in  ns  increment  y  and  the  continuity  of  ft,/,  hi.  u  ;y ) 
in  y  under  the  norm 

y  I,.'  ••  max  v. 

Here  denotes  the  essential  supremum  I*),  p  1121 

If  y  ( i )  ~  1)  for  i  <  0.  then  from  <I4)  we  have 

( I  b )  <■> :  /.  (  ii.  it  .  y  I  =  ^  |  |V</>  •  ii  ( t )  .t  >l'y  1 1 ) 

*  X  w  '<  r )  // (  //./.n  )y  ( v  ></v  -  It '</)  IV/ ((Hr ),/ llyW  )!<//. 

An  application  of  I  ubini  s  theorem  |‘>|  to  interchange  the  limits  of  integration  enables  us  to 
express  (lb)  as 

(  I  '  I  i5 1  /.  (  ii.  it  .  y  )  =  ft  I  (  ii  ,y  )  4  ^  y  •/)/*(  //.  w ./  )dl. 

where 

b  1  i  ii  . y  )  -  §  |V</>  •//(/)./  I )  y  ( !  hll 


jd 
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and 

(18)  F(u.w.r)  «  f  r  H'(u.s.i)*(s)ds  -  IV/<m (/ )./)]'  *U).  0  <  /  T 

With  this  notation  the  dual  of  Primal  Problem  A  will  be  shown  to  be: 

Dual  Problem  A: 

Minimize 


(19) 

6' (m.h)  = 

Hu. h  )  -  8|Z.  (m.h  ;//) 

subject  to  the  constraints 

(20) 

//(/),  n(r 

)  s*  o.  o  i  ^  r. 

(21) 

F*(m.  w.i) 

+  lV^(H</)./)|  <  0.  0  <  r  < 

(22) 

//(/)  =  0. 

i  <  0 

and 

(23) 

h  (r )  =  0. 

i  >  T. 

THEOREM  2  (Weak  Duality):  If  r  and  (m.h)  are  feasible  solutions  for  Primal  and  Dual 
Problems  A,  respectively,  then 

Hr)  <  G (m.h). 

PROOF:  By  the  concavity  of  <b  and  -/in  their  first  arguments  and  the  concavity  of  the 
composite  function  h(y(\i).i )  in  r  it  follows  that  L  is  concave  in  its  first  argument  and 

L(z.yv)  —  L(u,w)  ^  h\L(u.w,z  -  u). 

Thus, 

F(z)  -  G(u.w)  =  Hz.w)  -  f  w'U)F(z.i)dt 

J  0 

—  L(u.k)  +  ft\L(u,w,u) 

<  8 1 L  ( u.  h  ;r  -  m )  +  8 ,  L  (  m.  w  ,u  ) 

-  f  w'U)F(z.l)dt 
Jd  , 

=  8]Z,  (m.h  ;r)  —  w'(t)  Fiz,t)di 

by  the  linearity  of  the  Frechel  differential  in  its  increment.  By  (17)  we  have 

biHu,w\z)  —  f  w'{i)F(z.l)dl  =  f  z'(/){ [V<£  (u  (r  ),i  ))  +  /■*  (m,h,/  )}di 
1  •/  ()  0 

-  f  w'(i)Fiz.i)di 
Jo 

which  is  nonpositive  by  constraints  ( I ),  (2),  (20)  and  (21)  Q  l  D 


From  Theorem  2  it  is  observed  that  if  there  exist  feasible  solutions,  z  and  (m.h  ).  lor  the 
primal  and  dual  problems  and  if  the  corresponding  primal  and  dual  objective  function  values. 
Hr)  and  O'(m.h  ),  are  equal,  then  these  solutions  are  optimal  for  their  respective  problems 
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5.  THE  CONSTRAINT  QUALIFICATION. 

The  constraint  qualification  introduced  here  is  motivated  by  the  form  of  the  Kuhn-Tucker 
constraint  qualification  presented  by  Zangwill  [11]  and  also  by  Property  1  given  below.  The 
basic  theory  surrounding  this  qualification  is  established  to  provide  a  framework  for  the 
theorems  of  Section  6. 

PROPERTY  1:  If 

(24)  8k(r;y)  =  f T y'(t)[V<t>(z(i).t)]di  >  0 

J  0 

where  z,  y  €  Z.”  [0,7],  then  there  exists  a  scalar  <r  >  0  such  that 
V(z  +  Ty)  >  Viz),  for  0  <  t  <  cr. 

PROOF:  By  (15)  and  (24) 

lim  {V(z  +  Ty)  —  F(z)]/t  =  8  Viz\y)  >  0, 

r|0 

thus  a  positive  <r  can  be  chosen  which  is  sufficiently  small  so  that 

Viz  +  Ty)  >  Viz),  for  0  <  r  ^  cr.  Q.E.D. 

DEFINITION  1:  For  each  z  which  is  feasible  for  Primal  Problem  A,  define  Diz)  to  be 
the  set  of  n-vector  functions  y  for  which 

(i)  y  €  £„“  (0.71 

(ii)  y(r)  —  0,  for  /  <  0 

(iii)  there  exists  a  scalar  <r  >  0  such  that 

zit)  +  ry  it)  >  0.  0  <  i  T, 

and 

Fiz  +  Ty./)  ^  0.  0  <  /  <  T, 
for 

0  <  t  <  c r. 

DEFINITION  2:  Define  Diz)  to  be  the  closure  of  Diz)  under  the  norm  IHI“:  that  is,  if 
a  sequence  [y<<]  C  Diz)  is  such  that  ||yrf-  y  11^  ~ *  0,  as  t/  —  «>,  then  y  €  Diz) 

Henceforth,  the  Frechet  differentia!  of  the  mapping  Fi-.i):  L  "  [0,7]  —  Em  evaluated  at  z 
and  with  increment  y,  will  be  denoted  by  hFiz, y ), .  It  should  be  observed  that,  for  any 
specified  value  of  /  €  [0,7],  the  existence  of87(z;y),  is  ensured  by  the  differentiability  of  /, 
gj,  and  h  and  that  when  y(t)  -  0  for  /  <  0,  we  have 

(25)  87(z; y),  -  f '  Hiz.t.s)y  is)ds  -  [V/(z(/).f)]y(r). 

•7  0 

Similarly,  the  Frechet  differential  of  a  component  F,i-,t)  of  Fi-.t),  evaluated  at  z  with  incre 
ment  y,  will  be  denoted  by  87, (z;y),. 
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DEFINITION  3:  For  each  z  which  is  feasible  for  Primal  Problem  A  define  v*(z)  to  be  the 
set  of  //-vector  functions  y  for  which 

(i)  y  €  10,7*1, 

(ii)  y(/)  =  0,  for  r  <  0, 

(iii)  y/,(t)  ^  0  a.e.  in  7"u  (z ),  k  =  1  ...  ,  //, 

(iv)  bF,(z\y),  -5  0  a.e.  in  7\,(z),  i  =  1 . ///. 

where 

Tu(z)  =  1/  €  [0. ri:  z,,</)  =  0}.  k  —  1 . /; 

and 

T2i(z)  =  (f  €  [0.7']:  F,(z.i)  =  0!.  /  =  1 . 

In  a  comparison  of  the  sets  D(z)  and  i/(r )  with  their  finite-dimensional  counterparts 
presented  in  Zangwill  [II],  it  is  observed  that  D(z)  is  analogous  to  the  set  of  "feasible  direc¬ 
tions"  at  z  and  2(z)  is  analogous  to  that  set  of  directions  for  which  the  directional  derivatives 
of  each  of  the  active  constraints  at  r  are  nonnegative. 

PROPERTY  2:  D(z)  C  Q(z). 

PROOF:  Part  1.  Let  y  €  D(z).  Then  by  Definition  1.  there  exists  a  scalar  <r  >  0  such 
that  0  <  t  ^  <r  implies  z(r)  +  ry  (t )  ^  0,  0  <  t  ^  T.  Thus,  if  z*  (/  >  =  0.  then  y k  </ >  ^  0. 

Assume  that  F,(z.t )  =  0.  If  8/,(z;y),  <  0.  then  by  the  same  technique  used  in  the  proof 
of  Property  1.  it  follows  that  for  r  sufficiently  small. 

F,(z  +  ry.r)  <  F, (z.f)  =  0. 

This  result  contradicts  the  assumption  that  y  €  D(z)  and  therefore  we  conclude  that 
D(z)  C  z). 

Part  2.  Assume  that  there  is  a  y  €  L“  |0, T|  and  a  sequence  (yi  C  /)<;)  such  that  max 

n 

I  ly/  -  y  J  I00  —  0,  as  cl  —  «>.  Then  for  all  /  such  that  z*  (f )  =  0,  yl!(l)  0,  d  =  1 .  2 .  It 

then  follows  from  convergence  in  U°  |0, T]  that  y ^ ( / )  >  0  a.e.  on  Tu(z).  k  =  1 . n. 

Assume  there  exists  an  /  and  a  set  E  of  positive  measure  over  which  /-,  (z. / )  =  0  and 
S/-,(z; y),  <  0  for  all  /  €  F.  By  the  continuity  of  bF,(z\),  in  the  L”  norm  |7],  we  can  choose 
a  //*  sufficiently  large  such  that  for  J  >  d' 

f>F,(z.yJ),  <  0 

over  some  subset  of  E  which  has  positive  measure.  This  result  yields jt  contradiction  to  Part  1 
since  it  was  assumed  {y^}  C  D(z)  and  we  can  therefore  conclude  that  D(z)  c  <y(z).  g  j. 

DEFINITION  4  (Constraint  Qualification):  Primal  Problem  A  will  be  said  to  satisfy  the 
Constraint  Qualification  if  the  problem  is  feasible  and  if 
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where  z  is  an  optimal  solution  to  the  problem. 

In  more  general  problems  where  convexity  and  concavity  properties  are  not  assumed,  the 
purpose  of  the  Constraint  Qualification  would  be  to  eliminate  "cusps"  in  the  feasible  region. 
For  example,  the  constraints 

z,(r)  >  0,  z2(t)  ^  0,  0  <  t  ^  T, 

and 

[i  -  z^t)]3-  z2U)  >  o,  o  <  t  <  r, 

do  not  satisfy  the  Constraint  Qualification  when  z(t)  =  (1,0),  0  ^  t  ^  T,  since 

(1/2,0)  €  mi)  but  (1/2.0)  <7  D(z). 

In  problems  such  as  Primal  Problem  A  where  convexity  and  concavity  properties  are 
assumed,  violations  of  the  Constraint  Qualification  can  be  shown  to  arise  when  the  constraints 
take  the  form  of  equalities  on  some  set  of  positive  measure.  For  example,  consider  the  con¬ 
straints 

Z| (/)  >  0.  z2U)  >  0,  0  <  t  <  T. 
and 

[z,(r)  +  z2(i)  -  l]2  <  1  -  lE(t),  0  <  t  ^  T, 

where  £  is  a  set  of  positive  measure  in  [0,Tj  and  /£(  )  is  its  indicator  function.  It  is  observed 
that  for  Hr)  =  (1/2. 1/2),  we  have  (1.1)  6  3(z)  but  (1,1)  G  Z)(z),  thus  the  Constraint 
Qualification  is  not  satisfied 

THEOREM  3:  If  z  is  optimal  for  Primal  Problem  A,  then  under  the  Constraint 
Qualification 

8f(:,yl  0.  for  all  y  €  9(z). 


PROOF:  Part  1  Suppose  there  exists  a  y  €  D(z )  such  that  8  F(z;y)  >  0.  Then  by  Pro¬ 
perty  1  there  exists  a  <r  >  0  such  that  0  <  t  ^  a  implies  F(z  +  ry)  >  K(z);  however,  since 
y  €  D(z)  we  can  choost  a  (r0  sufficiently  small  so  that  z  -I-  cr0y  is  feasible  for  Primal  Problem 
A.  Thus,  by  contradiction  of  the  optimality  of  z,  we  can  conclude  that  8f(:;y)  <  0,  for  all 
y  €  D(z ). 

Part  2.  Let  (y'l  be  a  sequence  of  functions  in  D(z)  and  let  y°  be  such  that  max 
1 1 y*  -  y*  I 1“  —  0,  as  It  then  follows  from  Part  1  and  the  continuity  of  8  F(z;  )  that 

8F(?;y°)  -  limSKf^y-O  <  0. 

Thus,  8  K(z;y)  0  for  all  y  6  O(z).  Q.E.D. 

6.  DUALITY  AND  RELATED  THEOREMS 

In  proving  strong  duality  and  its  related  theorems  two  additional  assumptions  will  be 
made.  These  are: 

(26)  W(z.f.s)  Sf  0,  0  <  s  <  /  T 


and 

(27) 
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F{z,t)  -  8F(z;z),  >0,  0  <  /  <  T, 

where  z  is  an  optimal  solution  for  Primal  Problem  A.  It  will  be  shown  in  Corollary  1  that 
assumption  (26)  is  implied  if  z(t)  =  0  is  feasible. 

THEOREM  4  (Strong  Duality):  Under  the  Constraint  Qualification  and  assumptions  (26) 
and  (27),  there  exists  an  optimal  solution  (u,w)  for  Dual  Problem  A  such  that  u  =  z  and 
C(J,iv)=  F(z). 

Before  proving  Theorem  4  the  following  linearized  problem,  called  Primal  Problem  A',  will 
be  considered: 

Maximize 

6  F(z'z  —  z) 


subject  to  the  constraints 

(28)  z(r)  >  0.  0  <  t  <  T. 

(29)  F(z,t)  +  8F(I;z  -  z),  >  0,  0  <  t  ^  T, 
and 

(30)  zO)  =*  0,  for  /  <  0. 

LEMMA  3:  Under  the  Constraint  Qualification,  z  is  an  optimal  solution  for  Primal  Prob¬ 
lem  A'. 

PROOF:  If  z  is  feasible  for  Primal  Problem  A',  then 

z(t)  —  z(t)  ^  0,  for  t  €  7j*(z),  k  =  1,  ....  n, 
and 

6Fj(z,z  —  z),  0,  for  /  €  T2i(z ),  i  —  1 . m, 

and  therefore  (z  —  z)  €  !&(z).  It  then  follows  from  Theorem  3  that,  under  the  Constraint 
Qualification, 

8  V(z  \z  -  1)  <  0 

for  all  z  satisfying  (28),  (29)  and  (30).  The  optimality  of  z  follows  since  z  is  feasible  for  Primal 
Problem  A'  and  since  6  F(z;0)  -  0.  O  F  D 


PROOF  OF  THEOREM  4:  We  rewrite  Primal  Problem  A'  in  the  form 
maximize 

C  a'(t)z(t)dt 

J  o 


subject  to  the  constraints 

z  (/ )  ^  0,  0  <  t  <  T. 
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and 

Bit)zit)  <  cit)  +  fQ'  KU.s)z(s)ds,  0  <  t  <  T, 

where  a  it)  -  [V <£(?(/),/)],  Bit)  ~  IV fizir),t) ],  cit)  -  F(z,i)  -  8Fiz,z),,  and  K(t,s)  = 
Hiz,t,s).  From  assumptions  (5),  (6),  (26)  and  (27)  it  is  observed  that  the  matrices  a(r), 
Bit),  cir )  and  Kit.s)  satisfy  the  requirements  of  Grinold’s  Duality  Theorem  [3).  Therefore, 
there  exists  an  m- vector  function  w  satisfying 

(31)  wit)  >  0,  0  <  r  <  T, 
and 

(32)  B'it)w(t)  >  ait)  +  f  T  fCis.t)wis)ds,  0  <  t  <  T, 
such  that 

(33)  J"q  w'it)  cit)dt  =  J*Q  a'it)zit)dt. 

Setting  wit)  —  0  for  /  >  T,  we  observe  from  the  identities  (14),  (17),  and  (18)  that  expres¬ 
sions  (32)  and  (33)  can  be  expressed  as 

(320  F'iz.w.t)  +  [V0(z(/),/)l  <  0,  0  <  t  <  T, 

and 

(330  Liz,w)  -  f>\Liz,w,z)  =  Viz), 

respectively.  From  (31)  and  (320  and  the  fact  that  wit)  =  0  for  t  >  T,  it  then  follows  that 
iz.w)  is  feasible  for  Dual  Problem  A  and,  from  (19)  and  (330 

(34)  C7(z,iv)  =»  Viz). 

Finally,  by  the  weak  duality  established  in  Theorem  2,  it  is  concluded  from  (34)  that  (z,  vP)  is 
an  optimal  solution  for  Dual  Problem  A.  Q.E.D 

In  order  to  apply  Theorem  4  in  practice,  it  is  desirable  to  be  able  to  verify  conditions  (26) 
and  (27)  without  prior  knowledge  of  the  optimal  solution  z.  The  following  corollary  provides 
this  capability. 

COROLLARY  1:  If 

-  3g„(-n,f)/dT7*  >  0, 

(35)  j  =  0,  . . .  ,  r,  i  —  \,  ,  p,  k  -  1,  ....  n, 

for  t)  €  E",  rj  >  0,  and  0  ^  t  <  T ’ 

(36)  F(0,t)  >  0.  0  <  t  ^  T, 

then  under  the  Constraint  Qualification  there  exists  an  optimal  solution  (w,w)  for  Dual  Problem 
A  such  that  u  —  rand  Giz.w)  -  Viz). 

PROOF:  We  have  from  (8)  and  (35)  that 

Hil.t.s)  -  T  /  is)  W hiyiz,t),t)]  lVg,(z(s),s)]  >  0,  0  <  t  T. 

j- o  l°.r— *»yl 

and  by  (36)  and  the  concavity  of  F  that 

Fill)  -  8Fil\z),  >  FiO.t)  2  0,  0  <  t  <  T. 
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From  these  results  it  follows  that  the  conditions  of  Theorem  4  are  satisfied.  Q  F.D 

THEOREM  5  (Complementary  Slackness  Principle):  If  :  and  (r.iv)  are  optimal  solutions 
for  the  Primal  and  Dual  Problems  A,  then 

(37)  fT  w'(r)F(z.t)dr  =  0 
J  o 

and 

(38)  f  z'(t)[F' (z.w.t)  +  [V«Mz(/),/)]U  =  0. 

J  0 

PROOF:  Since  z(t)  >  0  and  F*(z,w,r)  -+■  lV(t(5(/),/ )]  <  0.  0  ^  t  4  T.  ii  follows  from 
identity  (17)  that 

r  T 

I  z'(t)\F*  (z.w.t)  +  |V<6  (z(t).t)]}dt  =  <5,/  if.  h  .r  I  €  0. 

v  0 

and  therefore,  by  (33') 

Llz.w)  —  F(z)  =  f  i v'(t)F(z,t)dt  4  0 
() 

Since  w(/i  0  and  Flz.t )  >  0,  0  ^  t  4  7,  it  also  follows  that 

(39)  w’(t)F(z.t)dt  >  0, 

J  0 

thus  the  equality  in  (37)  is  established 

Similarly,  (33')  and  >39)  imply  that 
h,Hz.w:z)  ^  0 
and  therefore,  by  (17) 

Jn'  z’(t)\F*  (z.w.t )  +  |V<7.(f(/),r)||Jr  >  0. 

Since  ?(r)  >  0  and  F*  (z.w.t )  +  fV</5(r 0).r)|  <7  0.  0  <  :  I  have 

/»  f 

(  f'(r)!A*(;,H./)  4-  |V<*l;Ol. /!)}(/?  «  I' 

and  Ihu.s  ttie  equality  in  (38/  is  established  sF‘- 


THEOREM  6  (Kuhn- f  ucker  Conditions)  Assume  lhai  t35»  and  <36'  are  satisfied  foi 
Primal  Problem  A.  Then  under  the  Constraint  Qualification  r  c  an  opiimal  solmion  if  and  onh 
if  there  exists  an  m-vector  function  w1  such  that 

(i)  F*  (z.w.t )  +  lV</><z(r).r)l  4  0.  0  4  r  4  7 

r  r 

(jil  I  z'(t)\F*  (z.w.t)  ■+■  W<b(z(t).t)\)dt  =  0 

J  0 

(iii*  f  wit) Flz.t) (it  -  0 

•'ll 

(jvi  w  it)  $  0,  0  ^  t  4  T and  wd)  =  0.  t  >  7 

PROOF: 

Necessity  The  necessitv  o!  the  condition',  loliow-  !•■  ■  -■  - *u..:  and  ’  i  -ni  '  suk 

:h‘.  n  veal":  tuni  non  M  o’  llie  optima;  ■.oliitior,  ■  .  m)I.ii<ii  '  ■  "ll'T  vl:-i'i'  > 
throuRtl  '  o  1 
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Sufficiency:  Let  z  be  feasible  for  Primal  Problem  A.  Then  since  Lis  concave 
L(z)  -  L(J)  ^  8  V (z;z  -  z) 

-  f{)  |z(/)-?(/)]'[V<Mz(r),i)]<//. 

Since  zit)  >  0,  0  ^  <  T,  it  follows  from  conditions  (i)  and  (ii)  that 

L(z)  -  L(z)  ^  -  f  (z(r)  —  z(t)Y F*(z,w,t)dt, 

and  by  (18),  (25)  and  Fubini's  Theorem  (9] 

J-  r  ri 

!;(/)-  z(t)Y  F'(z.w,i)dt  =  h'(/)87(z;z  -  :),dt. 

o  Jo 

By  (i),  (iii)  and  the  concavity  of  F, 

-  ['  w(t)f>F(l\z  -  :),dt  ^  -  ['  w'(t)[F(;.t)  -  FCz.rUdr 

=  -  f  \i'(r)F(z./)dr 
J  ii 

which  is  nonpositive  since  wit)  ^  0  and  /  (z,/)  ^  0,  0  ^  ^  7  Thus,  fir)  ^  f(z)  and  z  is 

an  optimal  solution  for  Primal  Problem  A.  q  I  D 

7.  EXAMPLE  -  WATER  STORAGE  PROBLEM 

In  the  water  storage  problem  posed  in  [4|,  the  hydroelectric  company  incurred  a  penalty  il 
it  could  not  meet  a  prescribed  demand  for  power  This  penally  was  characterized  in  the  objec¬ 
tive  function 

tkiDii)  -  Pit))dt 

where  [0.71  represents  a  planning  period  of  specified  duration.  Dit)  is  the  demand  rate.  Fit) 
is  the  production  rate  of  hydroelectric  power,  and  <//  is  the  penalty  function  which  was  assumed 
to  be  strictly  convex.  The  imposition  of  such  a  penalty  favors  the  consumer  or  a  middleman 
utility  company  which  retails  electric  power  to  the  consumers  In  short,  it  characterizes  a 
"buyers  market." 

If  there  is,  in  fact,  a  pending  energy  crisis,  it  seems  appropriate  to  consider  a  "sellers 
market"  where  the  demand  for  power  exceeds  production  capacity  and  a  premium  is  paid  to  ihe 
hydroelectric  company  for  any  power  which  it  produces  beyond  some  prescribed  level  In  the 
case  where  the  hydroelectric  company  is  supplying  power  directly  to  the  consumer,  these  premi¬ 
ums  may  take  the  form  of  increasing  prices  per  unit  beyond  some  allotment  level  When  the 
hydroelectric  company  is  supplying  a  middleman,  the  premiums  may  represent  an  incentive  pol¬ 
icy  which  encourages  maximum  production  during  peak  demand  periods 

The  premiums  to  the  hydroelectric  company  will  be  represented  by 

f  '  t r  (Pit)  -  Ait))dt 

Jo 

where  (0.71  represents  the  planning  period.  Pit)  is  the  power  production  rate,  ,-f  < / )  is  the 
prescribed  aggregate  allotment  or  incentive  level,  and  it  is  the  premium  function  which  is 
assumed  to  be  differentiable  and  concave  with  a  positive  slope  at  zero. 


For  the  dynamics  of  the  problem,  we  assume  a  confluent  system  of  rivers  supplying  water 
to  a  hydroelectric  plant  on  the  main  stream  with  r  of  its  tributaries  also  having  their  own 
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hydroelectric  plums  The  variables  and  parameters  which  relate  to  the  dam.  reservoir  and  plant 
on  the  main  stream  will  be  subscripted  by  0,  and  those  lor  the  r  dammed  tributaries  by  /. 
/  =  I . r 

We  let  U  ,  denote  the  initial  store  of  water  in  reservoir  ./and  W,  the  capacity  of  reservoir  / 
The  rate  of  spillage  and  rate  of  discharge  through  ihe  turbines  of  dam  /  at  time  t  are  denoted  by 
sir  I  and  </,(/).  respectively.  The  rates  of  inflow  of  water  into  the  reservoirs  on  the  dammed 
tributaries  are  £,(/)  ./  =  1 . r,  and  that  into  the  main  reservoir  from  its  undammed  tribu¬ 

taries  is  £,>(r). 

It  is  assumed  that  it  takes  <»,,  /  =  I . r  units  of  lime  for  the  water  released  from  dam 

/  to  reach  the  main  reservoir  and  that  there  is  no  spillage  or  discharge  through  the  dams  on  the 
tributaries  for  at  least  a  units  of  time  prior  to  the  start  of  the  planning  period,  where 
o  =  max  (or,  |  The  store  of  water  in  reservoir  j  at  time  t  can  then  be  expressed  as 

H',(r)  =  n  +  f  If  ,(/')-  ,v,(/')  -  d,(r'))di‘ 

Jo  ■  ' 

tor  /  -  I . r,  and 

=  «„  +  f 

0 

for  the  main  reservoir 

The  power  production  rate  for  a  given  rate  of  discharge  d  is  assumed  to  be  proportional  to 
d.  In  |4|,  it  was  necessary  to  assume  the  factor  of  proportionality  to  be  unity.  Here  we  allow 
this  factor  to  be  proportional  to  the  head  of  water  in  the  reservoir,  an  assumption  which  is  con¬ 
sistent  with  constant  turbine  efficiency.  The  head  is  the  difference  h  between  the  surface  level 
of  the  reservoir  and  the  tailwaters  below  the  dam  and  is  therefore  dependent  primarily  upon  the 
store  of  water  W  in  the  reservoir. 

The  relationship  between  /;,(/).  the  head  of  reservoir  j,  and  W/,(/)  will  be  represented  by 
h,[t)  —  /?*  (W'(r)),  where  h *  is  an  increasing  concave  differentiable  function.  The  functions 
h‘  owe  their  concavity  to  the  shapes  of  the  reservoirs  which  are  assumed  to  yield  a  continu¬ 
ously  disproportionate  increase  in  reservoir  surface  area  as  the  store  of  water  increases.  The 
production  rate  for  the  yth  hydroelectric  plant  is  then  expressible  as 

P,U)  -  d,(r)°  h'  (W ,(!)), 

in  which  case  the  production  rate  for  the  entire  system  becomes 

Pit)  -  £  />,</). 

,-n 


( I  ’)  —  St)(  I ')  —  dt)( ! ')  +  (*,  ( t  ~  o,l  +  d,(l  —  a  ( 1 1 


dr’ 


Assuming  the  role  of  the  hydroelectric  company,  we  want  to  select  our  water  storage  pol¬ 
icy  ts.d)  so  as  to  maximize  the  premium  payments  over  the  planning  period  This  problem 
takes  the  form 

maximize 

nts.d)  —  f  ntP(l)  ~  A(t))dt 

J II 
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subject  to 

0  sj  s,(r)  ^ 

0  <  </,(/)  «S  4>, 

0  ^  W,(i)  ^  0, 

j  —  0 . r ,  and 

.4  (/)  Pit) 

tor  0  ^  t  ^  T,  where  is  the  maximum  allowable  spillage  rate  through  dam  /  and  <l>  is  the 
turbine  capacity  of  plant  j. 

Through  proper  association  of  the  terms  of  this  model  with  those  of  Primal  Problem  A  it 
can  be  shown  through  application  of  Theorem  1  that  feasibility  ensures  the  existence  of  an 
optimal  water  storage  policy  which  will  maximize  the  total  premium  payment 
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ABSTRACT 

I  his  paper  considers  the  classical  linile  linear  transportation  Problem  III  and 
two  relaxations.  (Ill  and  (111),  of  it  based  on  papers  bv  Kantorovich  and  Rubin¬ 
stein.  and  Kretschmer  Pseudo-metric  type  conditions  on  the  cost  matrix  are 
given  under  which  Problems  (II  and  (II)  have  common  optimal  value,  and  a 
proper  subset  of  these  conditions  is  sufficient  for  Problems  (III  and  (III)  to 
have  common  optimal  value  The  relationships  between  the  three  problems 
provide  a  proof  of  Kantorovich's  original  characterization  of  optimal  solutions 
to  the  standard  transportation  problem  having  as  many  origins  as  destinations 
The  results  are  extended  to  problems  having  cost  matrices  which  are  nonnega¬ 
tive  row-column  equivalent 


1.  INTRODUCTION  WITH  PROBLEM  SETTING 

Over  25  years  ago  Kantorovich  in  his  classic  paper.  "On  the  translocation  of  masses"  [4], 
formulated  generalized  transportation  problems  which  arc  continuous  analogs  of  the  well-known 
transportation  problem  in  the  theory  of  finite  linear  programming.  He  raised  the  question  of 
characterizing  optimal  solutions  to  those  problems  whose  finite  dimensional  versions  have  the 
same  number  of  origins  as  destinations.  As  is  well  known,  optimal  solutions  to  the  standard 
finite  dimensional  transportation  problem  having  "m"  origins  and  "n“  destinations  are  charac¬ 
terized  by  means  of  a  system  of  linear  inequalities  involving  m  row  numbers  and  n  column 
numbers  which  together  comprise  a  feasible  list  of  dual  variables. 

Within  the  finite  dimensional  context  m  =  n.  Kantorovich's  goal  was  to  use  only  n 
numbers  in  a  linear  inequality  system  characterization  of  an  optimal  solution  rather  than  the 
standard  2n  (row  plus  column)  numbers.  In  order  to  accomplish  this,  three  conditions  defining 
a  pseudo-metric  were  imposed  on  the  cost  coefficient  matrix  Actually,  the  triangle  inequality 
condition  on  unit  costs  is  what  (iomory  and  Hu  later  termed  "reasonable  costs"  in  their  network 

“The  work  of  the  lirsl  author  was  supported  in  pari  by  National  Science  Inundation  (irants  I  MiKi-OMRI  anil 
FN(i78-25488 
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studies  |3).  Section  2.  Violation  of  this  particular  condition  is  also  related  to  the  "more  for  less" 
paradox  in  the  transportation  model,  see  Ryan  (7). 

The  original  application  of  the  pseudo-metric  conditions  involved  subtleties  which  were 
later  clarified  in  Kanlorovich-Rubinstein  (5]  but  for  a  transformed  version  of  the  standard  tran¬ 
sportation  problem,  which  we  state  as  Problem  111  in  the  next  section  In  attempting  to  give  a 
proof  of  Kantorovich's  characterization,  Kretschmer  [6]  introduced  yet  another  transformation 
of  the  standard  problem,  which  we  shall  term  Problem  II  in  the  next  section. 

The  basic  purpose  of  this  paper  is  to  delineate  the  key  relationships  between  these  three 
problems:  the  standard  transportation  Problem  1,  the  Kretschmer  transformed  Problem  II.  and 
the  Kanlorovich-Rubinstein  Problem  111.  The  results  we  obtain  depend  on  how  the  three 
pseudo-metric  cost  conditions,  denoted  (C.l)  through  (C',3)  in  Sections  3  and  4,  are  coupled 
together. 

Our  main  application  is  to  obtain  a  proof  of  the  originally  sought  for  characterization  of 
optimal  solutions  of  the  standard  transportation  problem  where  the  number  of  origins  equals 
the  number  of  destinations.  We  are  not  prepared  at  this  time  however  to  state  that  we  have 
industrial  or  public  sector  applications  of  the  type  II  or  type  III  transportation  models. 

2.  THE  KANTOROVICH-RUBINSTEIN  AND  KRETSCHMER  TRANSFORMS 
OF  THE  STANDARD  TRANSPORTATION  PROBLEM 

Let  c,;,  a,  and  b,(/  =  1 . n\  j  =  1 . n)  be  nonnegative  real  numbers  and  assume 

that  a,  and  b,  satisfy 

(I  D  X  a,  =  £  b,  >  0. 

*-i  ,-i 

The  original  transportation  problem  may  be  expressed  as  follows: 

(I)  Determine  the  minimum  value  M  of 

(12)  ZL‘</.v„ 

/-i  /-i 

subject  to  the  condition  that  x„  are  nonnegative  and 
(1.3)  £  xh  =  a<  *'  =  1 . «). 

i-i 

n 

Z-v,  -  0=1 . «> 

i- 1 

Let  us  consider  the  following  transportation  problems  which  were  studied  in  151  and  |6|: 


(II)  Determine  the  minimum  value  IV  of 


Z  L  ^  .K/) 

,-i  ,-i 


subject  to  the  condition  that  v,;  and  >,,  are  nonnegative  and 
(1.5)  £  (x,,  -  y„)  -  a,  (i  —  I . n). 

t- 1 

£  (x„  -  v,,)  -  h,  (./-l . n  ). 

.-i 
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(111)  Determine  the  minimum  value  V  of 

(i-6)  tiw 

/-i  j- 1 

subject  to  the  condition  that  ztJ  are  nonnegative  and 
(1.7)  £  2y  ~  X  ZJ>  =  a<  ~  b‘  =  *-  •  •  •  -  »)- 

J- 1  i- 1 

Program  1  of  course  is  the  classical  transportation  problem  which  may  be  solved  by  the 
well-known  row  and  column  number  method  (I1},12))  and  other  more  modern,  large  scale  pro¬ 
gramming  methods.  The  row  and  column  number  method  easily  extends  to  solving  Program  11. 
On  the  other  hand,  the  structural  matrix  of  Program  HI  is  a  network  incidence  matrix,  and  so 
III  is  an  uncapacitated  network  problem. 

It  is  clear  that  V  <  M  and  N  <  M  and  in  this  sense  Problems  11  and  III  are  relaxations  of 
Problem  I.  We  shall  study  when  one  of  the  equalities  V  =  /V,  V  =  A/,  and  M  —  N  holds. 

3.  THE  EQUALITY  N  -  V  OF  PROBLEMS  11  AND  Ill 

First  we  have 

LEMMA  1:  The  inequality  V  <  N  holds  if  the  following  condition  is  fulfilled: 

(C.l)  cu  «  c,,  for  all  /  and  j. 

PROOF:  There  exists  an  optimal  solution  xtJ  and  y,j  of  Problem  (II),  i.e.,  x,,  and  ytJ  are 
nonnegative  and  satisfy  (1.5)  and 

JV-il  cu  (x,j  +  yu). 

-I  j- 1 

Then 

(£*</  +  iy,<  -  +  Z>d  -  a>~  b<- 

U-t  /- 1  \j- 1  j- i  } 

Taking  —  xtJ  +  yjt,  we  see  that  z(j  are  nonnegative  and  satisfy  (1.6),  so  that  by  condition 

(C.l) 

V*ttwN- 

'-1 1- 1 

THEOREM  1:  The  equality  V  -  N  holds  if  condition  (C.l)  and  the  following  condition 
are  fulfilled: 

(C.2)  crt  -  0  for  all  /. 

PROOF:  There  exists  an  optimal  solution  zt)  of  Problem  (III),  i.e.,  zt)  are  nonnegative 
and  satisfy  (1.7)  and 

r-tic,v 

/-i /-i 
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Then 

£  zu  +  b,  =  £  Zj,  +  a,  =  d,  >  0. 

/-I  7-1 

Let  us  take  xy  =  0  if  i  ^  y  and  x„  =  4  and  put  v„  =  z,,.  Then  xy  and  y0  are  nonnegative  and 
satisfy  (1.5),  so  that 

^<11  C,<*y  +  y,j  )=il  t'y’„  =  T 

I- 1  /-I  1-1  1-1 

by  conditions  (C  D  and  (C.2). 

We  show  by  an  example  that  the  equality  N  =  L  does  not  hold  in  general  if  we  omit  con¬ 
dition  (C.2). 

EXAMPLE  1:  Let  n  =  2  and  take 

t'll  =  ‘‘22  =•■  t’i2  =  4  21  =  2, 

Oi  =  1,  aj=2,  =  2,  b\  =  1. 

Then  we  easily  see  that  V  =  2  and  Af  =  /V  =  4. 

4.  THE  EQUALITY  M  =  N  OF  PROBLEMS  I  AND  II 

Next  we  show  that  the  equality  M  =  N  does  not  hold  in  general  even  if  both  conditions 
(C.l)  and  (C.2)  are  fulfilled. 


EXAMPLE  2:  Let  n  =  3  and  take 

C,,  =  £*22  =  C „  =  0.  t*,2  =  £*21  =  20. 

C|J  ~  £*.M  =  £*2.1  =  <'.12  =  *  • 

o )  =  3/2.  flj=  1/2.  £/.,=  1/4, 


*!  =  *2  =  >•  *3“  1/4- 

By  special  methods  of  linear  programming  (see,  for  instance  [1]),  we  see  that  M  =*  ~  and  an 

optimal  solution  of  Problem  (I)  is  given  by  xu  =  1,  x22  -  1/2  x,2  =  -  xy}  -  1/4  and 

•*2i  ”  *ii  “  *2i  ”  *33  ”  0.  We  have  N  —  1.  An  optimal  solution  of  Problem  (II)  is  given  by 


*ii  ”  L  *22  “  1/2,  *ii  ”  Xi2  “  1/2,  x 1 2  ”  *21  —  X21  —  x,|  “  Xu  ”  0,  yn  —  1/4  and  >*„  ~  0  if 
U.j)  *  (3.3). 


Our  main  result  is  the  following  one 


THEOREM  2:  The  equality  M  —  N  holds  if  the  following  condition  is  fulfilled: 
(C  3)  f„  c„  +  t'pj  -I-  Cpq  for  all  i,  j.  p,  q. 


PROOF:  There  exists  an  optimal  solution  x„  and  y„  of  Problem  (II).  In  case 
z,j  -  x()  -  y,j  is  nonnegative  for  each  i.  j ,  we  sec  that  zu  is  a  feasible  solution  of  Problem  (1).  so 
that 

M  <  £  £  C>J  Z<>  £  L  L  +  T,y)  -  N- 

-I  /- 1  I—  I  /- 1 
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We  consider  the  case  where  some  xu  -  yu  are  negative.  We  may  assume  that  min(x„,  y0)  -  0 
for  all  /  and  j.  There  exist  p  and  q  such  that  0  ->  Then  we  have  by  (1.5) 

n  n 

£  ^  k  anc*  X  xpj  ^  y![x] . 

<-l  y-l 

Let  us  define  A,,  Bj  and  dtJ  by 


A„  -  B,  -  0, 

n 

n 

A/  “  X^ypq!  X  Xiq 

0  *  p).  Bj  -  x^yj  £ 

l— 1 

y-l 

Then 

4y  “  A,Bjl y^. 

n 

n 

(4.1) 

£  4y  “  <  ** 

r  £4/“  Bj  <xpj, 

y-l 

/-I 

(4.2) 

n  n 

£4-£*y- 

ypq- 

We  define  x,y 

(-1  y-l 

and  y,y  by 

(4.3) 

xij  xo  A-  d,j 

if  i  ^  p  and  j  ^  q , 

xpj  ”  —  ®y 

if  7  *  <?, 

Xk,  *  x-^  Aj 

if  /  ^  p. 

yij  ~  y» 

if  /  ^  p  or  y  ^  <7, 

& 

1 

sf- 

« 

© 

Then  xij  and  y,'  are  nonnegative  and  satisfy  (1.5)  and 

+  y',j) 

i- 1  ;-i 

n  n  n  n 

"XX  cij(xu  A"  A-  £  £  4ylcy  —  c«7  ~  cn  ~  cpj^ 
i- 1  y-l  i— I  y— 1 

<X  X  cn  +  yu)  -  h 

/-I  y-l 

by  condition  (C.3).  Repeating  the  above  procedure  (4.3)  a  finite  number  of  times,  t  we  obtain 
x*  which  are  nonnegative  and  satisfy  (1.3)  and 

W<XX  cux0  SXX  c<M>j  +  y<?  “  ^ 

/-I  y-l  /-I  y-l 

Hence,  M  -  N. 

THEOREM  3:  Let  k0,  f,  and  g,  be  nonnegative  numbers  and  assume  that  condition 
(C.3)  holds  for  ktJ  instead  of  ctJ.  If  ctJ  -  k„  4-  /,  +  gJy  then  M  -  N. 


'This  number  is  it  most  the  number  of  | y,,  >  0) 
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PROOF:  Denote  by  Mik)  and  Nik)  the  values  of  Problems  (1)  and  (ID  respectively  if 
c„  are  replaced  by  k,r  Then  we  have  M  =  M (A  )  +  Q*  with 

O*  -  Z  fa,  +  Z  gjbj. 

<-t  j-i 

Let  x,j  and.v,,  be  nonnegative  and  satisfy  (1.5).  Then 

Z  Z  c>iLxn  +V“Zt  k>M, +  >’</  >  + 

i-li-l  i-ly-l 


n 

„  ■ 

n 

n 

Z  /. 

a,  +  Z  y.j 

+  Z «/ 

b,  +  Z  y,j 

/-I 

i-i 

1-1 

i-l 

>  Z  Z  ^Ti  +  Tn'  +  >  Nik)  +  C,K. 

«-l  7-1 

Thus,  /V  >  Nik)  +  ClK.  Since  Nik)  -  Mik )  by  Theorem  2,  we  have  A  >  Mik)  + 
-  A/,  and  hence  M  =  V 

As  is  well  known,  two  transportation  problems  of  type  (I)  with  costs  (c„|  and 
(c„  +  f,  +  #,l  respectively,  are  equivalent  for  any  list  of  real  numbers  (./)},  |#y).  /  =  1,  ....  m, 

j  =  1 . n.  The  following  example  shows  that  the  nonnegativity  of  all  the  ./,  and  ,e,  is 

required  in  Theorem  3. 

EXAMPLE  3:  Let  n  —  2  and  take  ku  —  0,  ku  “  1/2,  k2|  =  1/2,  A;22  —  0,  o,  —  I,  a,  —  I, 
6,  -  1/2,  62  -  3/2,  /,  =  1,  f2  =  -5/2,  =  2,  g2  -  7/2.  Then,  Mik)  =  Nik)  =  1/4  while 
/V  -  9/2  <  5  -  M 

5.  KANTOROVICH’S  THEOREM  FOR  PROBLEM  (I) 

The  finite  version  of  Kantorovich's  Theorem  [41  can  be  written  as  follows: 

A  feasible  solution  xtJ  of  Problem  (I)  is  an  optimal  solution  if  and  only  if  there  exist 
numbers  u,  such  that 

(5.1)  |w,  -  a,|  <  c„  for  each  i,  j, 

(5.2)  u,  —  u,  —  e,y  if  x„  >  0. 


We  show  that  this  theorem  is  not  valid  as  it  stands.  In  fact,  let  us  recall  Example  2  and 
let  x„  be  the  optimal  solution  obtained  there.  If  there  exist  numbers  u,  which  satisfy  (5.1)  and 
(5  2),  then  we  must  have 


u  j  —  m2  =  C|2  —  20. 
Ml  ~  M2  =  fjj  =  I, 
M j  ~  M,  =  C|,  -  1 


This  is  impossible. 

In  order  to  give  another  proof  of  Kantorovich's  Theorem.  Kretschmer  considered  Prob¬ 
lem  (ID  and  asserted  N  -  M  without  any  assumption.  Notice  that  N  <  M  in  Example  2 
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Kantorovich's  Theorem  was  amended  by  Kantorovich  and  Rubinstein  (5;  Theorem  3]  in 
the  following  form: 

THEOREM  4:  Assume  that  conditions  (C.l),  (C.2)  and  (C.3)  hold  Then  a  feasible 
solution  ,v„  of  Problem  (III)  is  an  optimal  solution  if  and  only  if  there  exist  numbers  //,  which 
satisfy  (5.1)  and  (5.2). 

Under  conditions  (C.l)  and  (C.2),  the  dual  problems  of  Problems  (II)  and  (III)  coincide 
and  Theorem  4  is  an  immediate  consequence  of  the  well-known  duality  theorem  applied  to 
Problem  (II).  Thus,  condition  (C.3)  can  be  omitted  in  Theorem  4. 

Notice  that  conditions  (C.l),  (C.2)  and  (C.3)  hold  if  and  only  if  the  cost  <•„  is  a  pseudo¬ 
metric,  i.e.,  e„  satisfies  conditions  (C.l)  and  (C.2)  and  the  following  condition 

(C.4)  c,j  <  c,*  +  ckl  for  all  /.  j.  k. 

With  the  aid  of  Theorems  2  and  4,  we  have 

THEOREM  5:  Assume  that  conditions  (C.l),  (C.2)  and  (C.3)  hold  Then  a  feasible 
solution  xtJ  of  Problem  (I)  is  an  optimal  solution  if  and  only  if  there  exist  numbers  u,  which 
satisfy  (5. 1 )  and  (5.2). 
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ABSTRACT 

A  deterministic  capacity  expansion  model  lor  two  facility  types  with  a  finite 
number  of  discrete  time  periods  is  described.  The  model  generalizes  previous 
work  by  allowing  for  capacity  disposals,  in  addition  to  capacity  expansions  and 
conversions  from  one  facility  type  to  the  other  Furthermore,  shortages  of 
capacity  are  allowed  and  upper  bounds  on  both  shortages  and  idle  capacities  can 
be  imposed  The  demand  increments  for  additional  capacity  of  any  type  in  any 
time  period  can  be  negative.  All  cost  functions  are  assumed  to  be  piecewise, 
concave  and  nondecreasing  away  from  zero.  The  model  is  formulated  as  a 
shortest  path  problem  for  an  acyclic  network,  and  an  efficient  search  procedure 
is  developed  to  determine  the  costs  associated  with  the  links  of  this  network 


INTRODUCTION 

In  a  previous  paper  [9],  we  described  a  deterministic  capacity  expansion  model  for  two 
facility  types.  The  model  has  a  finite  number  of  discrete  time  periods  with  known  demands  for 
each  of  the  two  facilities  in  any  period.  At  the  beginning  of  each  period,  facility  /'(/'  =  1,2) 
may  be  expanded  either  by  new  construction  or  by  converting  idle  capacity  of  one  facility  to 
accommodate  the  demand  for  the  other  facility. 

In  this  paper,  we  extend  our  previous  work  by  allowing  for  the  reduction  of  facility  size 
through  capacity  disposals.  Furthermore,  shortages  of  capacity  are  allowed  and  upper  bounds 
on  idle  capacities  and  shortages  may  be  imposed.  These  generalizations  allow  us  to  deal  with 
more  realistic  situations.  Capacity  disposals  are  often  initiated  due  to  high  holding  cost  of  idle 
capacity  when  the  cumulative  demand  decreases  over  some  successive  periods.  Capacity  shor¬ 
tages  may  be  attractive  when  capacity  may  be  temporarily  rented  or  imported  from  other 
sources.  Also,  in  some  applications  it  may  be  economical  to  permit  temporary  shortages  and 
pay  a  penalty  for  unsatisfied  demand,  rather  than  expanding  the  facilities  at  that  time.  Finally, 
upper  bounds  on  idle  capacity  and  shortages  are  usually  imposed  by  management. 

The  costs  incurred  include  those  for  construction  of  new  capacity,  disposal  of  existing 
capacity,  conversion,  holding  of  idle  capacity,  and  for  having  capacity  shortages.  As  in  |9], 
conversion  implies  physical  modification  so  that  the  converted  capacity  becomes  an  integral  part 
of  the  new  facility  and  is  not  reconverted  automatically  at  the  end  of  the  period.  The  capacity 
expansion  policy  consists  of  timing  and  sizing  decisions  for  new  constructions,  disposals,  and 
conversions  so  that  the  total  costs  are  minimized. 
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The  model  is  useful  for  communication  network  applications,  such  as  the  cable  sizing 
problems  examined  in  [9],  Suppose  the  demands  for  two  cable  types  is  known  for  the  next  T 
periods.  Furthermore,  suppose  the  more  expensive  cable  can  accommodate  both  demand 
types,  whereas  the  cheaper  cable  can  be  used  only  to  satisfy  its  associated  demand.  Since  the 
construction  cost  functions  are  often  concave,  reflecting  economies  of  scale,  it  can  become 
attractive  to  use  the  more  expensive  cable  for  future  demand  for  both  cables.  Thus,  careful 
planning  of  the  expansion  policy  is  needed.  A  similar  application  is  the  planning  of  capacity 
expansion  associated  with  communication  facilities  which  serve  digital  and  analog  demands. 
Other  areas  of  applications  include  production  problems  for  two  substitutable  products,  and 
inventory  problems  of  a  single  product  produced  and  consumed  in  two  separate  regions,  see  19] 
for  more  details. 

Many  capacity  expansion  models  and  closely  related  inventory  models  have  been 
developed  for  the  single  facility  problem  with  a  finite  number  of  discrete  time  periods.  The 
first  such  model  was  proposed  by  Wagner  and  Whitin  [13]  who  examined  a  dynamic  version  of 
the  economic  lot  size  model.  Many  authors  extended  this  mode);  for  example,  Manne  and 
Veinoti  [11],  Zangwill  [16]  and  Love  [8],  Zangwill  used  a  network  flow  approach,  and  Love 
generalized  the  model  to  piecewise  concave  cost  functions  and  bounded  idle  capacities  and 
shortages. 

Several  models  and  algorithms  for  two  facility  problems  have  been  developed.  Manne 
[10],  Erlenkotter  [1,2],  Kalotay  [5],  and  Fong  and  Rao  [3]  examined  models  in  which  it  is 
assumed  that  converted  capacity  is  reconverted  automatically,  at  no  cost,  at  the  end  of  each 
period.  Kalotay  [6],  Wilson  and  Kalotay  [14],  Merhaut  [12],  and  Luss  [9]  examined  models  in 
which  converted  capacity  is  not  reconverted  automatically  at  the  end  of  each  period. 

In  Section  1  we  describe  the  generalized  model.  The  algorithm  in  [9]  is  extended  and 
used  to  solve  the  new  model  with  the  additional  features  described  before.  In  Section  2  a  shor¬ 
test  path  formulation  is  presented,  and  in  Section  3  some  properties  of  an  optimal  solution  are 
identified.  These  properties  are  used  to  compute  the  costs  associated  with  the  links  of  the  net¬ 
work  constructed  for  the  shortest  path  problem.  In  Section  4  the  solution  is  illustrated  by  a 
numerical  example,  and  some  final  comments  are  given  in  Section  5. 

1.  THE  MODEL 

The  model  assumes  a  finite  number  of  discrete  time  periods  in  which  the  demand  incre¬ 
ments,  new  constructions,  capacity  disposals,  and  capacity  conversions  occur  instantaneously 
and  simultaneously  immediately  after  the  beginning  of  each  period.  We  define  the  following 
notation: 

i  —  index  for  the  two  facilities. 

/  —  index  for  time  periods  (r  =  1,2 . T)  where  Tis  the  planning  horizon. 

r,,  —  the  increment  of  demand  for  additional  capacity  of  facility  i  incurred 

immediately  after  the  beginning  of  period  i.  The  r„'s  may  be  negative,  and 
for  convenience  are  assumed  to  be  integers. 

’i 

i.'j)  -  £  r"’  f°r  ^  h 


x, 


—  the  amount  of  new  construction  ( x„  >  0),  or  capacity  disposal  ( x„  <  0), 
associated  with  facility  i  immediately  after  the  beginning  of  period  t. 
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v,  —  the  amount  of  capacity  converted  immediately  after  the  beginning  of  period 

t.  y,  >  0  (y,  <  0)  implies  that  capacity  associated  with  facility  I  (facilit>  2)  is 
converted  to  satisfy  the  demand  of  the  other  facility.  Once  converted,  the 
capacity  becomes  an  integral  part  of  the  new  facility. 

/„  —  the  amount  of  idle  capacity  </„  >  0).  or  capacity  shortage  </„  <  0).  associ¬ 

ated  with  facility  /  at  the  beginning  of  period  /  (or  equivalently,  at  the  end  of 
period  /  —  1,  f  =  2,3,  ,  T  +  1).  We  assume  that  initially  there  is  no  idle 

capacity  or  capacity  shortage,  that  is,  /, ,  =  0. 

I„  —  lower  bound  on  /„,  that  is,  the  maximum  capacity  shortage  of  facility  / 

allowed  at  the  beginning  of  period  / ;  the  /„' s  are  assumed  to  be  integers  and 
-  °°  ^  0. 

—  upper  bound  on  the  idle  capacity  of  facility  /  at  the  beginning  of  period  i. 
The  w„'s  are  assumed  to  be  integers  and  0  ^  w„  <  °o. 

c„(x„)  —  the  construction  and  disposal  cost  function  for  facility  /  at  time  period  /. 

g^y,)  —  the  conversion  cost  function  at  time  period  /. 

/»„(/, ,l+1)-  the  cost  function  associated  with  idle  capacity,  or  capacity  shortage,  of  facil¬ 
ity  /carried  from  period  /  to  period  1  +  1. 

All  cost  functions  are  assumed  to  be  concave  from  0  to  °°  and  from  0  to  but  not 
necessarily  concave  over  the  entire  interval  [— <»,  °°].  Such  functions  are  called  piecewise  con¬ 
cave  functions,  see  Zangwill  {151.  All  cost  functions  are  also  assumed  to  be  nondecreasing 
away  from  zero;  for  example,  c„(x„)  is  nondecreasing  with  x„  for  x„  ^  0,  and  nondecreasing 
with  -x„  for  x„  <  0.  For  convenience,  we  assume  that  c„( 0)  =  g,( 0)  =  h„(0)  -  0. 

The  problem  can  be  formulated  as  follows: 

f  r  |  2  ) 

(1.1)  Minimize  £  £  c,M„)  +  M4,<+t)  +  &0v) 

xir  vi  /-I  [  /«] 

(1.2)  /|.,+|  =  / 1,  +  x,,  -  y,  -  ru 

(1.3)  l2.,+ 1  -  h,  +  x2l  +  y,  -  r2, 

(1) 

(14)  /„  <  /,,  <  w„ 

(1.5)  /,,  -  0 

(1.6)  /,r+i-0 

The  objective  (1.1)  is  to  minimize  the  total  cost  incurred  over  all  periods.  Equations 
(1.2)  -  (1.3)  express  the  idle  capacity  or  capacity  shortage  1  as  a  function  of  /„,  the  actions 

undertaken  at  period  /,  x„  and  y,,  and  the  demand  increments  r„.  Constraints  (1.4)  specify  the 
bounds  on  idle  capacities  and  capacity  shortages,  and  Equation  (1.5)  is  introduced  by  assump¬ 
tion.  Constraint  (1.6)  /,  r+|  —  0  implies  that  idle  capacity  or  capacity  shortages  are  not  allowed 
after  period  T.  Such  a  constraint  is  not  restrictive  since  one  can  add  to  problem  (Da  fictitious 
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period  7”  =  7"  +  1  with  r,r  =  max  7?,(l,r)  —  R,(,\,T)  (yielding  7?, (1,7”)  >  71,(1,/)  V/), 

I 

!,r  —  0 ,w,r  —  °°,  and  c,r{  )  —  (•)  —  #r()  =  0.  (I,r  is  fixed  at  zero  since  no  shortages  are 

allowed  at  the  end  of  period  T).  This  allows  us  to  fix  /,  r*i  at  zero  since  then  there  always 
exists  an  optimal  solution  with  l,  rn  =  0-  To  simplify  notation,  we  assume  that  period  7~  in 
formulation  (1)  is  the  added  fictitious  period. 

The  constraints  (1.2)  -  (1.6)  form  a  nonempty  convex  set.  Since  each  term  of  the  objec¬ 
tive  function  is  nondecreasing  away  from  zero  with  a  finite  value  at  zero,  there  exists  a  finite 
optimal  solution.  Furthermore,  suppose  each  of  the  variables  x„ ,  yn  and  /„  is  replaced  in  for¬ 
mulation  (1)  by  the  difference  of  two  nonnegative  variables,  for  example,  x„  =  x'„  -  at where 
x'„  >  0  represents  constructions  and  x,"  >  0  stands  for  disposals.  In  that  case,  the  objective 
function  becomes  concave  on  the  entire  feasible  region;  hence,  there  exists  an  extreme  point 
optimal  solution.  From  Pages  124-127  in  Hu  [4],  the  constraints  (1.2)  -  (1.3)  are  totally  uni- 
modular.  Thus,  since  r„ ,  /„  and  w„  are  assumed  to  be  integers,  such  an  extreme  point  solution 
consists  of  integers.  In  the  next  sections  we  describe  an  algorithm  which  finds  an  optimal 
extreme  point  solution. 

2.  A  SHORTEST  PATH  FORMULATION 

Since  all  cost  functions  are  nondecreasing  away  from  zero,  it  can  be  shown  that  there 
exists  an  optimal  solution  in  which 

(2)  |/„|  <  max  l/? i (x | , 7”)  +  R2(t2.T)]  S  b  Vi.  t. 

r,.r. 

However,  usually,  better  bounds  than  those  given  by  (2)  can  be  assigned.  To  simplify  the 
presentation,  we  assume  that  the  lower  and  upper  bounds  on  the  /„  variables  satisfy  w„  ^  b 
and  /„  >  -b  for  all  values  of  /  and  t. 

Generalizing  the  concept  of  capacity  point  in  (9J,  we  define  a  capacity  point  as  a  period  t  in 
which  /„  -  0,  or  /„,  or  w„  for  at  least  one  value  of  i.  Since  an  extreme  point  optimal  solution 
consists  of  integers,  the  set  of  capacity  points  is  defined  as  follows; 

(3.1)  /„  =  /2I  =  0 

(3.2)  / 1,  =  lu.0,wu  and  l2,  =  /2,. 0. w2, 

(3) 

(3.3)  / 1,  =  /(,,  0,  mzj,  and  l2l  =  /2,  +  l . 

(3.4)  / 2,  =  l2,,0.w2l  and  / 1,  -  l\,  +  1 . 

/  =  2,3, 

(3.5)  / i.r+ 1  =  ^2.  r+ 1  =  0- 

The  capacity  point  values  can  be  conveniently  specified  by  a  single  pa  ameter  For 
example,  a,  —  1,2,  ....  9  can  be  used  to  specify  the  combinations  given  >y  (3.2).  etc.  A 
complete  example  of  a  special  case  can  be  found  in  (9). 

The  set  of  capacity  points  can  be  limited  to  those  satisfying 

(4)  (4.1)  /,,  +  l2,  ^  R,(r,n  +  R2(t,T) 

(4.2)  ltl  +  /2(  >  -max  [/? t (r , —  1)  +  R2(t2,i-  1)1. 

'  i 


1.1.  . 
-1.1.  . 
-  ,  T 


•  w2.t- 1 
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Equation  (4.1)  stales  that  the  total  idle  capacity  at  the  beginning  of  period  t  does  not  exceed  the 
cumulative  demand  from  period  r  to  T.  Equation  (4.2)  restricts  the  maximum  capacity  shor¬ 
tages  to  the  maximum  demand  increase  from  any  period  prior  to  i  -  1  up  to  period  /  -  1 
Clearly,  there  exists  an  optimal  solution  which  satisfies  (4). 


We  now  describe  a  shortest  path  formulation  which  can  be  used  to  solve  Problem  ( 1  > 
Let 

c4v(«„.«vt|)  —  the  minimal  cost  during  periods  u,  u  +  1 . v  associated  with  an 

extreme  point  solution  of  (1)  when  u  and  v  +  1  are  two  successive  capacity 
points  with  values  defined  by  «„  and  «v+t  More  specifically: 


(5)  </„v(au 

,t*v+ 1>  =  minimum 

V 

I 

2 

^  II  (*11  ^  hit  d 1. 1  +  1  ^ 

+  ft  0/ ' 

such  that 

',r‘i 

/  —  14 

(i)  Constraints  (1.2)  and  (1.3)  are  satisfied  for  t  =  u.  u  +  1 . v, 

(ii)  /„  <  /„  <  w;,  and  /„  0  for  /  =  1,2  and  t  —  u  +  1.  //  +  2 . v. 

(iii)  / 1„  and  l2u  are  defined  by  and  /i.v+i  and  l2  \  are  defined  by  <*N,  |, 

(iv)  x„  and  y,  for  t  =  u,  u  +  1 . v  satisfy  the  necessary  conditions  (to  be  developed 

later)  for  an  extreme  point  solution  of  (1). 


Suppose  that  all  subproblem  values  «/„„(<*„. «V4 i>  are  known.  The  optimal  solution  can  then 
be  found  by  searching  for  the  optimal  sequence  of  capacity  points  and  their  associated  values. 
As  shown  in  Figure  1,  Problem  1  can  be  formulated  as  a  shortest  path  problem  for  an  acyclic 
network  in  which  the  nodes  represent  all  possible  values  of  capacity  points.  Each  node  is 
described  by  two  values  (/,<*,)  where  /  is  the  time  period  and  «,  is  the  associated  capacity  point 
value.  From  each  node  (//.«„)  there  emanates  a  directed  link  to  any  node  (v  +  l.«W|)  tor 
v  >  u  with  an  associated  cost  of  </mv(«I).hV4I). 


Let  C,  be  the  number  of  capacity  point  values  at  period  /.  Clearly,  C,  =  C/.i  =  1,  and  C, 
for  all  other  periods  can  be  obtained  from  Equations  (3)  and  (4).  The  total  number  of  links  A 
in  the  shortest  path  problem  is 

i  I  /  + 1  ) 


(6) 


C, 


I  C, 


/-l  1/-/M 


Since  most  of  the  computational  effort  is  spent  on  computing  the  </,M  (<»'„,«,,  | )  values,  it  is 
important  to  reduce  A,  if  possible.  One  way,  of  course,  is  to  reduce  the  values  o!  (  through 
the  imposition  of  appropriate  bounds  /„  and  h„ 


The  shortest  path  problem  can  be  solved  using  various  algorithms  Since  the  network  is 
acyclic  a  simple  dynamic  programming  formulation  can  be  used  l  et  <«.  be  described  bv  the  set 

of  integers  1.2 . C’M  where  «,  =  1  represents  /l(  =  l2,  =  0.  Furthermore,  let  ./,(<«  ,)  be  the 

cost  of  an  optimal  policy  over  periods  i,  i  +  1 .  T,  given  that  period  i  is  a  capacity  point. 

and  that  /,,  and  /2,  are  specified  by  The  following  dynamic  programming  formulation  is 
then  obtained: 

fr*  i  (a  r*  i  f 

(7)  /.(a,) 


u  -  1.2 . 


=  0,  « 7  +  1  —  1 

=  min  |<yi(V(«„.«v+|)  +  /v.|(nv.i>l. 

14  ^  / 

U.»  ..M 


»  I  V  •  1 


u  =  T.T-  1 . 


a 
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Figure  1  The  shortest  path  formulation 


The  first  term  of  the  minimand  is  the  minimum  cost  of  the  optimal  policy  during  periods  u , 
u  +  1,  ....  v,  given  that  u  and  v  +  1  are  two  successive  capacity  points  with  values  au  and 
«v+t-  The  second  term  is  the  optimal  cost  for  periods  v  +1,  v  +  2,  ....  T,  given  ov+t. 

3.  SOLUTION  OF  THE  SUBPROBLEMS  <fuv(a„,a,+1) 

Most  of  the  computational  effort  is  spent  on  computing  the  subproblem  values.  As  shown 
in  (91,  when  r„  ^  0,  x„  ^  0,  l„  —  0  and  w„  —  °°  for  ail  /  and  f,  the  subproblems  are  solved  in  a 
trivial  manner,  however,  when  the  r„’s  are  allowed  to  be  negative  the  effort  required  to  solve 
the  subproblems  increases  significantly.  The  additional  modifications  needed  to  solve  the  sub¬ 
problems  d„v(a1/.av.H),  as  defined  by  (5)  for  the  generalized  model,  require  a  more  careful 
analysis  than  needed  in  (9),  however,  the  resulting  computational  effort  appears  to  be  about  the 
same. 


To  compute  the  subproblem  values  d„v(au,av+l),  it  is  convenient  to  describe  Problem  (1) 
as  a  single  commodity  network  problem.  The  network,  shown  in  Figure  2,  includes  a  single 
source  (node  0)  with  a  supply  of  Rt(l,T)  +  R2(1,T).  There  are  2T  additional  nodes,  each 
denoted  by  (/,/)  where  /  specifies  the  facility  and  /  specifies  the  time  period.  At  each  node  (/,/) 
there  is  an  external  demand  increment  /■„,  possibly  negative.  The  nodes  are  connected  by  links, 
where  the  flows  along  these  links  represent  the  constructions,  disposals,  conversions,  idle  capa¬ 
cities,  and  capacity  shortages.  The  flows  on  each  link  can  be  in  either  direction,  and  the  link 
direction  in  Figure  2  indicates  positive  flows  The  nodes  are  connected  by  the  following  links: 


A  link  from  node  0  to  each  node  (/,/)  with  flow  x„.  x„  is  positive  if  the  flow  is  from 
node  0  to  node  (/./),  and  negative  otherwise. 


(M*\(  in  i -\i>  wmos  wu n  i  wo  i  \c  ii  in  n  im  s 


60.1 
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—  A  link  from  each  node  </,/)  to  node  (i.t  +  1)  with  flow  /,,(|  is  positive  if  the 

flow  is  from  (i.t)  to  ( i.t  +  1)  and  negative  otherwise. 

—  A  link  from  each  node  (l.r)  to  node  (2 ,t)  with  (low  v,.  y,  is  positive  if  the  How  is 
from  node  (1,/)  to  (2,r),  and  negative  otherwise. 

As  discussed  before,  we  are  interested  in  finding  an  optimal  extreme  point  solution  to  a 
modified  version  of  Problem  (1),  in  which  each  of  the  variables  ,v„.  y,.  /„  is  replaced  by  the 
difference  of  two  nonnegative  variables.  It  can  be  shown  that  a  feasible  How  in  the  network 
given  in  Figure  2  corresponds  to  an  extreme  point  solution  of  Problem  ( 1 )  modified  as 
described  above,  if  and  only  if  it  does  not  contain  any  loop  with  non/ero  flows  in  which  all  /„ 
flows  satisfy  l„  <  /„  <  w„  and  /„  ^  0. 

Concentrating  upon  a  single  subproblem,  as  shown  in  Figure  3,  one  may  observe  that  a 
feasible  flow  does  not  contain  such  loops  if  and  only  if  the  following  properties  are  satisfied: 

(8.1)  x„t  Jt„,  —  0  (f|  ^  t2),  i—  1,2 

(8)  (8.2)  }) ^'i.  ”  0  (t|  ^  I2).  u  ^  t\,  1 2,  1 3  ^  v 

(8.3)  A-|f|  “  0. 

For  example,  suppose  (8.3)  is  violated  and  t\  ^  n  ^  fj,  then  X|,  ,  /)  (|4 , . /Ux.  y,r  l;,r 

1 .  / 2,f ,+ 1  *  x2,  form  a  loop  with  nonzero  flows  and  all  relevant  /„  values  satisfy 

/„  <  /„  <  w„  and  /„  ^  0. 

Pquation  (8)  implies  that  in  the  optimal  solution  of  <4v(aru.<*v,  |)  there  is  at  most  one  new 
construction  or  disposal  for  each  facility  (8.1),  and  at  most  one  conversion  (8.2)  Furthermore, 
il  two  constructions  or  disposals  (one  per  facility)  are  being  considered,  conversion  is  then  not 
allowed  (8.3). 
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Figure  3  A  network  flow  representation  of  a  subproblem 


Let  D,  be  the  capacity  change  of  facility  i  during  periods  u,u  +  1 . v,  that  is: 

(9)  D,  “  />.v+i  +  RAu,  v)  -  /,u,  /  —  1.2 

or,  equivalently 

00)  Di-JjXu-y, 

r-u 

D2-  X  *2/  +^- 

l-u 

Let  fi  and  r2  be  two  time  periods  u  <  Ut.tj)  <  v.  From  the  optimal  properties  (8)  shown 
above,  the  possible  policies  associated  with  an  optimal  solution  to  any  subproblem  duv(au,av *  t) 
can  be  restricted  to  three  different  policies.  These  policies  are  summarized  in  Table  1  below 

To  illustrate  the  table,  let  us  concentrate  on  the  column  D\  <  0  and  D2  ^  0.  Policy  <al 
indicates  a  single  disposal  of  D\  capacity  units  of  facility  1,  and  a  single  construction  of  D 2  units 
of  facility  2.  Policy  (b)  implies  a  single  construction  of  Dt  +  D2  of  facility  1  if  D  +  P2  ^  0,  a 
single  disposal  of  D{  +  D2  of  facility  1  if  D]  +  D2  <  0,  and  a  single  conversion  of  I)2  units 
from  facility  1  to  facility  2.  Obviously,  if  Dt  +•  D2  -  0,  no  constructions  or  disposals  take 
place,  and  if  D2  -  0,  no  capacity  is  converted.  Finally,  policy  <c)  consists  of  a  single  construc¬ 
tion  of  D\  +  D2  capacity  units  of  facility  2  if  D\  +  D2  ^  0,  a  single  disposal  of  D\  +  l)2  units 
of  facility  2  if  Z>,  +  A2  <  0,  and  a  single  conversion  of  -D\  from  facility  l  to  facility  2 

The  optimal  solution  of  a  subproblem  duJ a„,av<.j)  is  therefore  obtained  by  the  following 
procedure: 


For  each  of  the  policies  (a),  (b),  and  (c)  in  Table  1,  find  the  optimal  values  of  r,  and 
t2,  which  minimize  duv(a u.av+\)  as  given  by  Equation  (5),  while  satisfying  condi¬ 
tions  (i)  -  (iv)  given  below  Equation  (5).  If  no  feasible  values  of  f)  and  t2  exist,  set 
the  value  of  the  corresponding  policy  to  «>. 


(1) 


<  \i*u  iiy  i  xp.ansion  wiiii  rwo  i  a<  ii  in  i  vim  s 
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TABLE  1.  Possible  Policies  for  Optimal  Subproblem  Solutions 


Policy  - — 

/>,  >  0 

D:  >  0 

D ,  <£  0 

D,  ^  0 

/2,  Js  0 

I),  0 

/>,  ^  0 
l);  $  0 

a  1  / ,  =  0|.  =  0  1  *  rt 

construction 

disposal 

construction 

disposal 

(a)  x2,}  =  D},  x2,  —  0  /  *  f2 

y,  -  0  Vr 

construction 

construction 

disposal 

disposal 

*i,,  -  D|  +  f>2.  -V|,  -  0  /  *  r, 

construction 

construction 

or  disposal 

construction 

or  disposal 

disposal 

(b)  =  D2.  y,  =  0  1  ^  t2 

x2,  -  0  V/ 

conversion 

from  1  to  2 

conversion 

from  1  to  2 

conversion 

from  2  to  1 

conversion 

from  2  to  1 

*2,,  -  D 1  +  D2.  Xj,  -  0  t  *  r, 

construction 

construction 

or  disposal 

construction 

or  disposal 

disposal 

(c)  y, 2 - /),.  y,  =  0  t  g  1, 

x„  -  0  V/ 

conversion 
from  2  to  1 

conversion 
from  1  to  2 

conversion 
from  2  to  1 

conversion 

from  1  to  2 

(2)  Choose  as  the  optimal  policy  the  best  of  those  found  in  Step  (I).  If  none  of  the  policies 

is  feasible,  tfuv(au,avf ))  •= 

The  procedure  above  may  involve  spending  a  significant  amount  of  computation  on 
finding  all  feasible  policies  and  comparing  the  costs  associated  with  these  policies. 

4.  A  NUMERICAL  EXAMPLE 

As  an  illustration,  we  solve  the  capacity  expansion  problem  shown  in  Figure  4 
/ 14  —  /M  =■  0  by  assumption,  thus,  a  fictitious  period  is  not  added  The  cost  functions  are  given 
in  Table  2  below. 

The  shortest  path  formulation  is  shown  in  Figure  5.  The  capacity  point  values  are  given 
inside  the  nodes  in  terms  of  / , ,  and  l2,  rather  than  a,.  Using  liquation  (4  1).  several  capacity 
point  values  are  omitted  in  periods  2  and  3.  Furthermore,  all  links  from  period  t  =*  1  to 
periods  t  —  3  and  4  are  omitted  since  (here  is  no  feasible  solution  to  the  associated  subprob¬ 
lems  with  / [ 2  <  I u  <  H’u  and  /)2  0.  The  number  associated  with  each  link  is  the  optimal 

solution  of  the  corresponding  subproblem  The  shortest  path  is  marked  by  stars 

Consider  the  subproblem  </i i <<* j .  «* 2 >  where  <»t  represents  the  capacity  point  value 
/n  *  h\  ”  0,  and  u2  represents  /:i  -  /;:  =  0  By  liquation  (9).  /J,  =  1  and  l):  =  1.  Using 
the  results  of  Table  1.  policy  (a)  yields  a,,  =  .\s,  =  I  with  a  total  cost  of  68.  policy  (bl  yields 
Xu  =  2  and  y,  -  1  with  a  cost  of  46,  and  policy  (c)  yields  v>,  =  2  and  \  \  —  -  1  with  a  cost  ol 
45  Hence  policy  (c)  is  the  optimal  one 

To  illustrate  further,  consider  d}^n .  where  «_>  stands  for  /,■  -  1  and  -  0.  and 

04  stands  for  / ,4  —  /?4  -  0,  so  that  /3,  -  I  and  D;  —  0  From  Table  I.  policy  (a)  implies  that 
either  xl}  —  1  or  v„  -  I  However,  if  x, ,  —  1  (and  xl?  -  0)  then  f ,  -  0  so  that  d;A  I  -  r» 
Hence,  policy  (a)  implies  a,;  —  1  with  construction  and  holding  cost  of  43  2  Policy  < b )  yields 
the  same  solution  as  policy  (a),  and  policy  < c )  results  in  a.;  -  I  and  v:  -  I  with  a  total  cost 
of  40  5.  hence,  policy  (c >  is  optimal  for  that  subproblcm 
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Figure  S  The  shortest  path  problem  lor  ihe  example 

Finally,  consider  d1}(a2.a4)  wiih  a2  standing  for  /,2  «  /?2  0,  and  a*  standing  for 

^14  ”  h*  *  0.  From  Table  1,  since  D |  —  02  —  0,  all  decision  variables  are  zero  in  all  three  pol¬ 
icies  and  the  total  costs  incurred  are  equal  to  9. 

After  solving  the  subproblems  for  all  the  links  of  Figure  5,  the  shortest  path  can  be  found 
using  the  dynamic  programming  formulation  (7)  or  any  other  shortest  path  algorithm  The 
shortest  path  in  this  example  is  54  and  consists  of  two  links.  The  first  link  connects  node 
!\\  “  h\  ”  0  to  node  /)2  *=  /22  *■  0,  and  the  second  link  connects  node  / 12  -  l:2  =  0  to  node 
“  724-  0.  The  optimal  policy  of  the  entire  problem  is  x2!  -  2,  v,  =  -1,  with  all  other 
decision  variables  x„  and  y,  being  equal  to  zero 

5.  FINAL  COMMENTS 

This  paper  generalizes  our  previous  work  (9)  by  allowing  for  capacity  disposals  and  capa¬ 
city  shortages.  Furthermore,  bounds  on  idle  capacities  and  capacity  shortages  can  be  imposed 
The  model  is  formulated  as  a  shortest  path  problem  in  which  most  of  the  computational  effort 
is  spent  on  computing  the  link  costs  Using  a  network  flow  approach,  properties  of  extreme 
point  solutions  are  identified  These  properties  are  used  to  develop  an  efficient  search  for  the 
link  costs. 


Further  generalizations  mav  include  bounds  on  new  constructions  and  capacity  disposals, 
and  operating  costs  which  depend  on  the  facility  type  and  time  period  As  shown  In  several 
authors,  for  example  Lambrecht  and  Vander  F.ecken  I7|,  bounded  constructions  or  disposals 
complicate  considerably  even  the  single  facility  problem  Introducing  operating  costs  may 
require  major  changes  in  the  algorithm  since  the  amount  of  each  capacity  type  used  io  satisfy 
the  demand  in  each  period  affects  the  total  cost 
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Finally,  negative  costs  for  disposals  (credit  for  salvage  value)  can  be  incorporated  for  cer¬ 
tain  cost  functions  c„(x„)  for  which  the  optimal  solution  would  be  finite.  For  example,  cost 
functions  in  which  the  credit  per  unit  of  disposed  capacity  is  always  smaller  than  the  construc¬ 
tion  cost  per  unit  of  capacity.  In  general,  however,  cost  functions  c„(v„)  that  are  negative  for 
v„  <  0  may  result  in  an  unbounded  solution. 
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ABSTRACT 

This  paper  considers  the  problem  of  locating  multiple  new  facilities  in  order 
to  minimize  a  total  cost  function  consisting  of  the  sum  of  weighted  Euclidean 
distances  among  the  new  facilities  and  between  the  new  and  existing  facilities, 
the  locations  of  which  are  known.  A  new  procedure  is  derived  from  a  set  of 
results  pertaining  to  necessary  conditions  for  a  minimum  of  the  objective  func¬ 
tion  The  results  from  a  number  of  sample  problems  which  have  been  exe¬ 
cuted  on  a  programmed  version  of  this  algorithm  are  used  to  illustrate  the 
effectiveness  of  the  new  technique 


1.  BACKGROUND 

It  was  as  early  as  the  17th  century  that  mathematicians,  notably  Fermat,  were  concerned 
with  what  are  now  known  as  single  facility  location  problems.  However,  it  was  not  until  the 
20th  century  that  normative  approaches  to  solving  symbolic  models  of  these  and  related  prob¬ 
lems  were  addressed  in  the  literature.  Each  of  these  solution  techniques  concerned  themselves 
with  determining  the  location  of  a  new  facility,  or  new  facilities,  with  respect  to  the  location  of 
existing  facilities  so  as  to  minimize  a  cost  function  based  on  a  weighted  interfacility  distance 
measure. 

If  one  studies  a  list  of  references  to  the  work  done  in  the  past  decade  involving  facility 
location  problems  it  becomes  readily  apparent  that  there  exists  a  strong  interdisciplinary  interest 
in  this  area  within  the  fields  of  operations  research,  management  science,  logistics,  economics, 
urban  planning  and  engineering.  As  a  result,  the  term  "facility"  has  taken  on  a  very  broad  con¬ 
notation  in  order  to  suit  applications  in  each  of  these  areas.  Francis  and  Goldstein  |4)  provide  a 
fairly  recent  bibliography  of  the  facility  location  literature.  One  of  the  most  complete 
classifications  of  these  problems  is  provided  in  a  book  by  Francis  and  White  |5] 


‘This  work  was  supported  by  the  National  Research  Council  of  Canada  under  Grant  A44I4  and  by  an  Ontario  Oradu 
ate  Scholarship  awarded  to  Paul  Calamai 
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This  paper  concerns  itself  with  the  development  of  an  algorithm  for  solving  one  particular 
problem  in  the  area  of  facility  location  research.  The  problem  involves  multiple  new  facilities 
whose  locations,  the  decision  variables,  are  points  in  E2  space.  The  quantitative  objective  is  to 
minimize  the  total  cost  function  consisting  of  the  sum  of  weighted  Euclidean  distances  among 
new  facilities  and  between  new  and  existing  facilities.  The  weights  are  the  constants  of  propor¬ 
tionality  relating  the  distance  travelled  to  the  costs  incurred.  It  is  assumed  that  the  problem  is 
"well  structured"  [3], 

The  Euclidean  distance  problem  for  the  case  of  single  new  facilities  was  addressed  by 
Weiszfeld  [13],  Miehle  [10],  Kuhn  and  Kuenne  [8],  and  Cooper  [1]  to  name  a  few.  However, 
it  was  not  until  the  work  of  Kuhn  [7]  that  the  problem  was  considered  completely  solved.  A 
computational  procedure  for  minimizing  the  Euclidean  multifacility  problem  was  presented  by 
Vergin  and  Rogers  [12]  in  1967;  however,  their  techniques  sometimes  give  suboptimum  solu¬ 
tions.  Two  years  later.  Love  [9]  gave  a  scheme  for  solving  this  problem  which  makes  use  of 
convex  programming  and  penalty  function  techniques.  One  advantage  to  this  approach  is  that  it 
considers  the  existence  of  various  types  of  spatial  constraints.  In  1973  Eyster,  White  and 
Wierwille  [2]  presented  the  hyperboloid  approximation  procedure  (HAP)  for  both  rectilinear 
and  Euclidean  distance  measures  which  extended  the  technique  employed  in  solving  the  single 
facility  problem  to  the  multifacility  case.  This  paper  presents  a  new  technique  for  solving  con¬ 
tinuous  unconstrained  multifacility  location  problems  involving  Euclidean  distances. 

2.  PROBLEM  FORMULATION 

The  continuous  unconstrained  multifacility  location  problem  involving  the  lp  distance 
measure  can  be  stated  as  follows: 

Find  the  point  X*T  =  (X*[,  ....  X*f)in  E2„  to 
(PI)  minimize  f(X)  -  £  vjk \\X2  -  A"* 1 wJ(\\Xj  -  Aj\\p 

1  <  j  <  k  <  n  y—  I  /—  1 

where 

n  A  number  of  new  facilities  (NF’s). 

m  A  number  of  existing  facilities  (EF s). 

A'/—  Otji  xj2)  A  vector  location  of  NFj  in  E2,  j  -  1 . n. 

AJ  -  (an  ai2)  A  vector  location  of  EF,  in  £2,  i  —  1 . m. 

\jlc  A  nonnegative  constant  of  proportionality  relating  the  lp  distance  between  NFj  and  NFk 
”  to  the  cost  incurred  I  <  j  <  k  <  n. 

h 'j,  A  nonnegative  constant  of  proportionality  relating  the  lp  distance  between  NF,  and  EF,  to 
the  cost  incurred  1  <  j  ^  w,  1  ^  i  ^  m. 

II Xj  -  Xk\\„  -  (Uyi  -  x*||'+  \xj2  -  xJ^'A  lp  distance  between  NFt  and  NFk. 

1 1 Xj  -  At\\p  “  llxyi  -  a, il'  +  |xy2  -  aaVY1"  A  lp  distance  between  NFj  and  EF, 

Note  that  we  make  the  assumption  that  vJk  -  v*,  for  j.k  —  1 . n.  Substituting  p  -  1 

and  p  •  2  in  Problem  PI  respectively  yields  the  rectilinear  distance  problem  and  the  Euclidean 
distance  problem. 
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For  the  purpose  of  this  paper  Euclidean  distance  will  be  the  measure  used  between  facili¬ 
ties  located  as  points  in  E2  space.  The  objective  function  becomes 

minimize  f(X)  -  £  vjk  {( xjX  -  xkX)2  +  ( xJ2  -  xk2)2)l/2 

X  l<><<t<n 

0*2)  +IX  wj>  l^i  “  an)2  +  (->02  ~ 

j- i  <-i 

The  techniques  presented  in  this  paper  can  also  be  used  for  problems  involving  facilities  located 
in  three-dimensional  space. 

3.  NEW  FACILITY  CATEGORIZATION 

If  we  consider  a  current  solution  to  Problem  P2  we  can  think  of  each  new  facility  as  being 
in  one  of  the  following  distinct  categories: 

(1)  Unique  Point  (UP) 

A  new  facility  in  this  category  occupies  a  location  that  differs  from  all  other  facility 
locations. 

(2)  Coinciding  Point  (CP) 

A  new  facility  in  this  category  occupies  a  location  that  coincides  with  the  location  of 
an  existing  facility  but  differs  from  the  current  locations  of  all  other  new  facilities. 
Thus,  each  new  facility  in  this  category  has  associated  with  it  some  existing  facility 
which  has  the  same  vector  location. 

(3)  Unique  Clusters  (t/C],  ....  UC^uc ) 

All  new  facilities  in  the  fcth  unique  cluster  (k  =  1,  ....  NUC)  occupy  the  same  vec¬ 
tor  location.  This  location  is  distinct  from  all  existing  facility  locations  as  well  as  the 
current  locations  of  new  facilities  that  are  not  classified  in  this  cluster 

(4)  Coinciding  Clusters  (CC|,  ....  CCWr) 

All  new  facilities  categorized  in  the  kth  coinciding  cluster  (k  -  1,  .  .  ACC)  occupy 

the  same  vector  location.  This  location  coincides  with  the  location  of  some  existing 
facility  and  differs  from  the  current  locations  of  all  new  facilities  that  are  not 
classified  in  this  cluster.  Each  of  these  coinciding  clusters  of  new  1  acilities  is  there¬ 
fore  associated  with  some  existing  facility  with  which  it  shares  a  location 

If  we  define  the  index  sets  /  A  jl . n!  and  /  A  jl . m )  and  let  the  subsets 

=  CC0  -=  <t>  then  the  categorization  can  be  restated  as  follows 

Partition  the  set  J  into  the  subsets  UP,  CP,  UC\ . UCsix  -  (  (.'i,  ,  (.'(  *<  <  where 

(3.1)  UP  =  (V,  r  J\A,  *  Xj*  Xk\ V/  €  /,  Vk  €  J  -  {./M 

(3.2)  CP  =  lv,  €  J\A,  -  Xj  *  Xk,  ijt  1,  V*  €  J  -  01! 

for  nr  «=  1 . NUC 

UC„  A  V ,  €  J  -  u'  UC,\A ,  *  X.  -  Xk ;  V /  €  /.  k  €./  -  |./)  -  V  UC, 

*  I  /- 0  /-n 


(3.3) 
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for  0  -  1 . NCC 

(3.4)  CC0  A  |vy  €  J  -  V  CC,\Alf  -  Xj  -  Xk\  ip  €  /,  k  €  /  -  0)  -  V  CC, 

/Vi/C  A  number  of  unique  clusters. 

NCC  A  number  of  coinciding  clusters. 

Note  that 

(a)  New  facility  j  coincides  with  existing  facility  ij  for  j  £  CP  (from  3.2). 

(b)  The  new  facilities  in  cluster  /9  coincide  with  existing  facility  ip  for  /3  —  1 ,  . . .  ,  NCC 
(from  3.4). 

In  order  to  use  this  notation  for  the  derivation  of  the  new  algorithm  given  in  the  next  sec¬ 
tion  define  a  unit  vector  D  in  £2n  as  follows: 


or=u>r. ... 

.  AT) 

where 

(3.5) 

Df~  Idj,  dj2). 

j  =  1. .. 

.  ,  n 

and 

IId||2=  l. 

(4.2b) 
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(4.2c)  Gj  =  £ 

(4.2d)  G,  =  £ 


-  JfA.)  _  Wj,(Xj  -  -4/) 
llA'-Jf*ll2  WXj-A,\\2 

vJk(Xj  -  Xk)  —  Wj,(Xj  -  A,) 
WXj-x.W,  fe\\Xj- a,\\2 


V  ,  6  UCa 
a  =  1 . NUC 

Vj  €  CCfi 
P=  1 . NCC 


It  should  be  noted  that  in  each  case,  the  expression  for  G,  is  the  gradient  of  that  part  of 
the  objective  function  /(JO,  which  is  differentiable  with  respect  to  X,.  In  the  case  where 
j  €  UP ,  the  expression  is  the  exact  gradient  with  respect  to  X,:  in  all  other  cases,  the  exprev 
sion  for  G,  can  be  considered  a  pseudo-gradient  of /(JO  with  respect  to  A',. 

Since  /(JO  is  a  convex  function,  the  point  X*  in  E2„  is  a  minimum  for  this  I  unction  ,f 
and  only  if  the  directional  derivative  dDf{ X*)  is  nonnegative  for  all  unit  vectors  D  in  L-ln.  This 
fact  will  be  used  in  the  next  section. 

5.  NECESSARY  CONDITIONS  FOR  OPTIMALITY 

THEOREM  1:  If  the  following  conditions  are  not  satisfied  at  tfu  point  A  in  t  then  the 
directional  derivative  given  by  expression  (4.1)  will  be  negative  for  some  un..  *ecu>.  />  in  L.., 

(5.D  (i)  i|cyl l2  =  0  \/j  e  up 

(5.2)  (2)  IIgJIj  <  Wj,'  V,  €  CP 

(5.3)  (3)  fora  =  1 . NUC 

IIIg,||2<E  I  VS  cue,, 

jiS  j(S  kHUCa-Si 

(5.4)  (4)  for/3  -  1,  ...  .  NCC 


HE  Gj n2<  e 

E  vJk 

+  *% 

JiT  jiT 

kdCCy-  T] 

PROOF:  The  proofs  for  conditions  1  and  2  are  obvious. 

]  =  R  for  j  €  5 

For  3)  setj  Dj  .  0  for  j  q  s 

then  dDf( X)  -  £  Gy  ■  R  +  £  £  vjk  1 1 1 1 2 

jiS  jiS  kHUCa-S) 

-  Il/dhlll  E  G,ll2  cos  9  +  X  E  vy*| 

j  jes  jts  k(iuc„-s]  J 

Therefore,  dDf( X)  >  0  VD  only  if 

ME  <  E  E  vjk 

j(S  JiS  *€|t/C„-Sl 

The  proof  for  condition  4  is  similar. 


VS  c  uca. 
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6.  I  PDA  I  E  FORMULAS 

As  a  result  of  the  prececding  optimality  conditions  the  following  update  formulas  are  con¬ 
structed: 

CASE  I:  If  3  j  €  UP  such  that  MG,!!  ^  0  then  the  direction  of  steepest-ascent  in  the 
subspace  defined  by  X,  is  G,  =  Gj.  We  therefore  use  the  following  update  formula  tor  Xr 

X,  ~-Xj  -  A,  G , 

where 

■>  _ 'jk 


(6.1) 


k*j 


+  z 


img --*’*112  fr,\\xJ-Aiw1 


CASE  2:  If  3 \j  €  CP  such  that  l|G;l|2  >  then  the  direction  of  steepest-ascent  in  the 
subspace  defined  by  Xj  is  G7  =  Gr  We  therefore  use  the  following  update  formula  for  Xr 

Xj  -  Xj  -  \j  Gj 

where 


(6.2) 


I 

k*j 


vJk 


+  Z 


,  11*,- **112  ft,  WXj- a, W, 

CASE  3:  If  35  C  t/Ca,  a  -  1 . NUC,  such  that 

IIXg,II2>X  I  vJk 

jtS  JtS  kt\UCn~S I 

then  the  direction  of  steepest-ascent  in  the  subspace  defined  by  the  subset  cluster  is 
Gs  =  £  Gj.  We  therefore  use  the  following  update  formula: 

,iS 

V,  6  S'  Xj  -  Xj  -  As  Gs 


where 
(6  3) 


A,  = 


I 

it  s 


y  :±.  ..  t  y  _J2L..r7 
kftt  []X'  ~  *»!■  ■  I*  -  4.;! 


-i 


CASE  4:  If  3PC  CC>,  0  -  1.  .  ACC  ,  such  that 

MX  G,\\ 

lf  T 

then  the  direction  of  steepest-ascent  in  the  subspace  defined  by  the  subset  cluster  is  Gj  - 
£  G  We  therefore  use  the  following  update  formula: 

r  r 


>  X 

X  VA 

+  w„ 

rlli 

,/€7- 

*f  [rr„-rl 

V  I-  / 


V.  ■■  X,  -  A  g7 
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where 

(6.4) 


X  r  — 


■M 


I 

k9CCa 


Vjk 


fl  Xj-Xk 


*02 


+  1 

/€/ 


\\Xj-  A,\\i 


In  each  result,  the  expression  for  lambda  (X)  can  be  considered  a  weighted  harmonic 
mean  [8]  of  the  interfacility  distance  terms  appearing  in  the  equation  for  the  gradient  (Case  1) 
or  pseudo-gradients  (Cases  2  through  4). 

7.  A  NEW  ALGORITHM 

Using  the  results  derived  in  the  preceeding  section  the  following  algorithm  can  be  used  to 
solve  Problem  P2: 

(1)  Find  a  current  solution  X  in  Eln. 

(2)  Try  to  obtain  a  better  solution  by  moving  single  new  facilities  by  using  Cases  1  and  2 

(3)  For  a  =  1,  ...  .  NUC  try  to  obtain  a  better  solution  by  applying  the  special  form  of 
Case  3  where  |S|  =  1  (to  move  single  new  facilities)  or.  if  this  fails,  applying  the 
special  form  of  Case  3  where  |S|  =  \UC„\  (to  move  entire  clusters  of  new  facilities). 

If  successful,  return  to  Step  2. 

(4)  For  (3  =  1 . NCC  try  to  obtain  a  better  solution  by  applying  the  special  form  of 

Case  4  where  |r|  —  1  (to  move  single  new  facilities)  or,  if  this  fails,  applying  the 
special  form  of  Case  4  where  \  T\  =  | ('('„!  (to  move  enlire  dusters  of  new  facilities) 
If  successful,  return  to  Step  2. 

(5)  Try  to  obtain  a  better  solution  by  moving  subset  clusters  using  Cases  5  and  4  ||  an 

improvement  is  made,  return  to  Step  2. 

8.  REMARKS  ON  IMPLEMENTATION 

The  following  rules  were  used  in  implementing  the  algorithm  described  in  the  Iasi  section 


(a)  New  facility  j  and  new  facility  k  were  considered  "clustered"  if  : 


(8.ia) 

or 

(Rib) 


I  Mr,  I 


I  Xh  1 1 2  <  *  i  \  ^  j  <  k  ^  n 


2  *  I  Mr, 


**ll2 


I  Mr,  I 


<  f, 


«*ll2 


1  <  /  <  k  <  n 


where  e,  A  inputted  cluster  tolerance. 


( b>  New  facility  j  and  existing  facility  /'were  considered  "coinciding"  if 

(8.2a)  1 1  1 1 2  +-  IMJI2  <  «i  2=1 . =  1 . m 

or 


(8  2b) 


2  •  I  Mr,  -  a,\ l, 

IUJIj.-f  full, 


j  —  I . n ,  1  —  1 ,  .  .  m 
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where  <1  A  inputted  cluster  tolerance, 

(c)  The  update  formulas  were  used  only  if: 

(8.3)  \llG||2>e2 

where  «2  A  inputted  step  tolerance.  This  helped  avoid  the  possibility  of  repeatedly  taking  small 
steps.  However,  the  step  tolerance  is  reduced  prior  to  the  termination  of  the  algorithm  as  out¬ 
lined  by  the  next  rule. 

<d)  In  order  to  ensure  optimality,  the  following  check  is  made  prior  to  executing  Step  5  of  the 

algorithm: 

(8.4)  l/U'*-")  -  f(X[hi)\  *  100  ^  *  /(*<*-") 

where  e3  A  inputted  function  tolerance. 

If  this  condition  is  not  satisfied,  the  step  tolerance  (e2)  is  reduced  and  the  algorithm  res¬ 
tarted  at  Step  2 

9.  DISCUSSION 

The  new  algorithm  has  the  following  properties: 

(a)  It  makes  full  use  of  the  structure  of  the  facility  location  problem  thus  avoiding  the 
need  for  any  background  in  related  nonlinear  programming  areas. 

(b)  The  actual  objective  function,  and  not  an  approximation  to  it,  is  minimized  at  each 
step  in  the  algorithm. 

<c)  The  stepsize  used  in  this  algorithm  may  not  be  "optimal”  when  compared  with  step- 
sizes  obtained  from  line-search  techniques.  However,  the  use  of  this  stepsize  has  the 
following  advantages:  a)  ease  of  computation,  b)  maintenance  of  location  problem 
structure,  and  c)  reduced  computation  time  per  update. 

(d)  Although  Step  5  in  the  algorithm  is  combinatorial  in  complexity,  very  little  computa¬ 
tional  work  is  necessary.  This  is  a  result  of  the  fact  that  all  the  information  needed 
for  this  step  has  already  been  computed  and  stored  in  previous  steps 

(e)  The  algorithm  is  similar  to  the  technique  devised  by  Kuhn  for  solving  the  single¬ 
facility  location  problems  with  Fuclidean  distances  17)  and  the  method  devised  by 
Juel  and  I.ove  |6]  for  the  multifacility  location  problem  with  rectilinear  distances 
This  makes  the  algorithm  attractive  to  those  with  experience  with  these  methods 

If)  Currently,  there  is  no  rigorous  proof  that  this  algorithm  converges.  In  1973.  Kuhn 
( 7 1  completed  the  proof  of  convergence  for  a  similar  scheme,  introduced  b\ 
Weiszfeld  ( 1 3 J  in  1937,  for  the  case  of  single  new  facilities.  Based  on  compuialional 
experience  and  on  the  fact  that  the  algorithm  is  designed  to  minimize  the  objective 
function  in  all  new  facility  subspaces,  it  is  likely  that  the  algorithm  always  converges 

(g)  The  main  disadvantage  of  the  algorithm  is  that  the  order  in  which  each  of  the  sub¬ 
spaces  is  checked  is,  currently,  not  optimal  A  method,  based  on  projections,  that 
would  allow  us  to  determine  "a  priori”  which  subspace  to  update,  is  now  being  inves 
tigated. 


1 
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<ht  Most  existing  methods  for  solving  the  multilaciliiv  problem  lack  im  toriMdei.iimn  "! 
the  existence  of  constraints  on  the  solution  space  !lM  1  !us  is  also  true  .il  the  new 
method  outlined  in  this  paper,  however,  the  addition  ot  constiamt>  should  n< >i 
present  a  problem  to  the  projection  technique 

(i)  It  has  vet  to  be  proven  that  the  necessary  conditions  lot  optimahtv  l<  .  Problem  P2 
given  by  Equations  (5.1)  through  (5.4).  are  also  sufficient 

10.  COMPl TATIONAL  EXPERIENC  E 

The  performance  of  the  algorithm  described  in  this  paper  ( MIT  I’V  I)  was  tested  against 
the  hyperboloid  approximation  procedure  (IIAP)  described  in  I  xyster.  White  and  Wierwille  i2! 
and  a  modified  hyperboloid  approximation  procedure  (MhAP)  suggested  bv  Ostresli  ill! 

Two  parameters  were  used  as  a  basis  of  comparison:  I)  the  number  of  new  facility  loca¬ 
tion  updates  needed  to  reach  optimality,  and  2)  the  required  C  PI!  time  in  minutes  In  the  ease 
of  program  MFLPV1,  two  counts  were  considered  necessary  for  specifying  the  first  parameter 
The  first  count  represented  the  number  of  "attempted"  updates  (excluding  those  updates  from 
Step  5  of  the  algorithm).  The  second  couni  represented  the  number  of  "successful"  updates 
The  reason  for  excluding  the  number  of  attempted  updates  from  Step  5  of  the  algorithm  is  sim¬ 
ply  this  computationally,  very  little  work  is  done  at  this  step  in  the  procedure 

Six  problems  were  used  for  the  comparison;  the  first  three  were  taken  from  15 1  (#5  23. 
#5.7  and  #5.6  respectively),  the  fourth  appears  in  12)  and  the  last  two  problems  summarized  m 
Tables  1  and  2,  are  the  authors. 

HAP  and  MHAP  were  both  executed  using  two  different  initial  hyperbolic  constants  t  1 
for  these  problems  in  order  to  emphasize  the  significance  of  this  parameter  to  the  performance 
of  these  algorithms  The  stopping  criteria  used  in  each  case  was  the  same  as  that  outlined  in 
the  paper  introducing  HAP  [2 1  Unless  otherwise  specitied,  program  Mi  l  P V  1  also  made  use  of 
the  following  data 

(1)  «|  A  cluster  tolerance  =  0.01  (from  Equations  (8.1)  and  (8  2)) 

(2)  fi  A  step  tolerance  =  0  05  (from  Equation  (8  3)) 

(3)  ()S  function  tolerance  =  0.01  (from  Equation  (8.4)). 

The  results  of  these  tests  are  summarized  in  Table  3.  The  numbers  in  tins  table  represent 
the  total  new  facility  updates  required  to  reach  optimality.  The  numbers  in  brackets  I  ),  under 
the  column  headed  MFLPV1,  represent  the  number  of  successful  updates  w  hereas  the  unbrack¬ 
eted  numbers  in  these  columns  represent  the  number  of  attempted  updates  I  he  following 
observations  and  comments  can  be  made  about  the  results  summarized  in  this  table 

(a)  In  all  but  Problem  5,  the  number  of  attempted  updates  required  to  reach  optimality 
using  MFLPVI  is  less  than  the  number  of  updates  required  by  HAP  and  MHAP 
These  numbers  are  directly  comparable 

(b)  The  new  procedure  (MFLPVI)  used  considerably  less  CPU  time  in  solving  the  six 
problems  than  did  HAP  and  MHAP 


\ 

i 
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TABLE  1  —  Input  Parameters Jor  Problem  5 


j 

v  «)> 

1  X/1 

*/? 

T 

0.0 

0.0 

2 

0.0 

0.0 

3  1 

6.0 

10.0 

/ 

",i 

a,  2 

4  j 

1.0 

3.0 

T"! 

0.0 

Tlo 

5  ] 

6.0 

10.0 

2 

2.0 

4.0 

6 

8.0 

8.0 

3 

6.0 

2.0 

7 

2.0 

4  0 

4 

6.0 

10.0 

8 

2.0 

4.0 

5 

8.0 

8.0 

0 

6.0 

10  0 

(a)  EF  Locations  < b)  Initial  NF  Locations 


K 

1 

2 

3 

4 

5 

1  2  3 

4 

5 

6 

7 

8 

9 

i 

10 

1.0 

1.0 

77" 

To 

1 

NJ.O  1.0 

To 

1  0 

1.0 

nr 

10 

1.0 

2 

11.0 

1.0 

1.0 

1.0 

1.0 

2 

! 

1.0 

1.0 

10 

1.0 

1  0 

1.0 

3 

1.0 

1.0 

1.0 

1.0 

1.0 

3 

1.0 

1.0 

1.0 

10 

10 

1.0 

4 

1.0 

1.0 

1.0 

1.0 

1.0 

4 

, 

V 

10 

1.0 

1.0 

1.0 

1.0 

5 

1.0 

1.0 

1.0 

1.0 

1.0 

5  1 

N 

1  0 

1.0 

1.0 

1.0 

6 

1.0 

1.0 

1.0 

1.0 

1.0 

6  : 

\ 

1.0 

1.0 

1.0 

7 

10 

1.0 

1.0 

1.0 

10 

7  | 

V 

1.0 

1.0 

8 

1.0 

1.0 

1.0 

1.0 

1.0 

8  j 

v 

1.0 

9 

1.0 

1.0 

10 

10 

1.0 

9  ! 

\ 

\ 

<c) 

w„  Weights 

(d) 

Weights 

\ 

TABL.F  2  —  Input  Parameters  for  Problem  6 


/ 

Cl,  2 

<<n 

•x/i 

V 

r 

2.0 

5.0 

./ 

2 

10,0 

20.0 

r 

5.0 

1  5.0 

3 

10.0 

10.0 

2 

5.0 

150 

(a)  F.F  Locations  <b>  Initial  NF  Locations 


v  i 

!  i 

| 

2 

3 

-  1 

0  16 

0.56 

0.16 

2  1 

0  16 

0  56 

0.16 

<c>  *■„ 

Weights 

<d)  vM  Weights 
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TABLE  3  —  Comparative  Test  Results  for  Six  Problems 


# 

MFLPV1 

_  «l0)  =  I0~4 

X' 

f(X') 

HAP 

MHAP 

HAP 

MHAP 

.. 

1 

564  (77) 

1661 

1381 

2027 

1407 

(1. 0,0.0) 

(1. 0,0.0) 
(1.0,00) 
(2.0,00) 
(2.0,00) 

38.990 

2 

148  (34) 

647 

. 

546 

4641 

2281 

(10.0,20.0) 

(10.0,20.0) 

186.798 

3 

770 

197 

43.351 

4 

31  (15) 

45 

45 

45 

45 

(2.832,2.692 

(5.096,6.351) 

67.250 

5 

223  (40) 

142 

114 

1763 

975 

(4.045,4.281) 
(4.045,4  281) 
(4.045.4.281) 
(4.045,4.281) 
(4.045,4.281) 
(4.045,4.281) 
(4.045,4.281) 
(4.045,4.281) 
(4.045.4.281) 

201  878 

6 

63  (7) 

242 

164 

3743 

1869 

(10.0,20.0) 

(10.0,20.0) 

TOTAL 

1092  (189) 

2824 

2320 

12989 

6774 

CPU 

0.07 

0.45 

0.50 

1.88 

1.48 

(c)  Five  of  the  six  problems  have  solutions  at  cluster  points.  This  appears  to  be  the  case 
in  many  other  problems.  This  suggests  that  methods  using  clustering  information, 
such  as  MFLPV1,  will  perform  better  than  methods  that  disregard  this  information 

10.  CONCLUDING  REMARKS 

To  date,  many  of  the  methods  designed  for  solving  the  multifacility  location  problem  have 
been  either  poorly  structured,  suboptimal  or  haphazard.  In  this  paper,  a  new  method  is 
developed  for  solving  the  multifacility  location  problem  involving  Euclidean  distances.  This 
new  method  can  easily  be  extended  to  accommodate  problems  involving  item  movements  that 
are  other  than  Euclidean  Computational  experience  shows  that  this  method  outperforms  tech¬ 
niques  currently  in  use  In  addition,  the  proposed  method  takes  full  advantage  of  the  structure 
of  the  location  problem 


Most  current  techniques  used  for  solving  location  problems,  including  those  proposed  in 
this  paper,  are  designed  to  minimize  an  unconstrained  objective  function.  This  is  an  incom¬ 
plete  treatment  since  most  practical  problems  involve  some  form  of  spatial  constraints  It  is 
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proposed  that  these  constraints  be  handled  and  the  performance  of  the  algorithm  improved 
through  the  use  of  projection  techniques.  This  approach  is  currently  being  investigated  by  the 
authors. 
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ABSTRACT 

The  fixed  charge  problem  is  a  mixed  integer  mathematical  programming 
problem  which  has  proved  difficult  10  solve  in  the  past  In  this  paper  we  look 
at  a  special  case  of  that  problem  and  show  that  this  case  can  be  solved  by  for¬ 
mulating  it  as  a  set  covering  problem  We  then  use  a  branch-and-bound  in¬ 
teger  programming  code  to  solve  test  fixed  charge  problems  using  the  set 
covering  formulation  liven  without  a  special  purpose  set-covering  algorithm, 
the  results  from  this  solution  procedure  are  dramatically  better  than  those  ob¬ 
tained  using  other  solution  procedures 


1.  INTRODUCTION 

The  linear  fixed  charge  problem  may  be  formulated  as: 


(1) 

Min 

iU 

nj 

(2) 

Subject  to 

x,  >  b,  it  /. 

/<  J 

(I) 

1  if  A,  >  0 

(3) 

y,  = 

0  otherwise  1  ( 

(41 

and  v,  ^  0, 

./  €  J 

for  /  =  j  I . m i  and  7=11.  ...  n) 

In  addition  to  continuous  costs,  the  variables  have  fixed  costs  which  are  incurred  when 
the  corresponding  continuous  variable  becomes  positive  All  cost  arc  assumed  to  he  nonnega¬ 
tive  Problem  (I  )  is  very  similar  to  the  standard  linear  programming  problem,  diflering  only  in 
the  presence  of  the  fixed  costs  In  spile  of  this  similarity,  it  hits  proven  to  be  tin  extremely 
difficult  problem  to  solve 

If  all  the  continuous  costs  are  zero,  we  have  a  special  case  of  the  fixed  charge  problem 
which  we  will  refer  to  as  problem  <PI  >  Problems  of  this  type  can  occur,  for  example,  when¬ 
ever  there  is  a  need  to  find  solutions  with  the  least  number  of  basic,  nondegenerate  variables 


62 1 


622 


P  MCKEOWN  AND  P  SINHA 


In  a  network  context,  Kuhn  and  Baumol  [4]  discuss  the  need  to  know  the  least  number  of  arcs 
necessary  to  carry  a  desired  flow.  Also,  in  the  survey  processing  field,  it  often  becomes  neces¬ 
sary  to  check  a  record  of  replies  to  a  questionnaire  and  to  determine  changes  to  make  the 
record  consistent.  In  this  case,  it  is  necessary  to  know  the  minimum  number  of  such  changes 
that  are  necessary  for  consistency.  Both  of  these  problems  are  examples  of  problem  (PF)  with 
the  former  having  the  standard  transportation  constraint  matrix  and  the  latter  having  a  general 
constraint  matrix  which  depends  upon  the  consistency  conditions. 

A  special  case  of  problem  (PF)  occurs  when  all  the  constraint  coefficients  are  nonnega¬ 
tive,  i.e.,  a,j  ^  0  for  all  j.  We  will  refer  to  this  problem  as  (PF+)  since  we  retain  the  condi¬ 
tion  that  all  continuous  costs  are  equal  to  zero.  In  this  paper,  we  will  demonstrate  a  solution 
procedure  for  (PF+)  based  on  a  revised  formulation  for  the  problem.  We  then  use  a  branch- 
and-bound  integer  programming  code  to  solve  the  revised  formulation.  The  results  from  this 
approach  will  be  compared  to  those  obtained  using  other  procedures. 

2.  A  REVISED  FORMULATION 

The  problem  in  which  we  are  interested  may  be  formulated  as  follows: 

(5)  Min  £  fjj 

jiJ 

(PF+) 

subject  to  (2)  -  (4) 

(6)  where  a0  >  0  for  i  €  /,  j  €  J 

(PF+)  remains  a  special  case  of  the  fixed  charge  problem  (F)  so  any  results  that  are  applicable 
to  problem  (F)  will  also  be  applicable  to  (PF+). 

Two  previously  derived  results  for  (F)  that  are  of  particular  interest  to  (PF  +  )  are: 


1)  any  optimal  solution  to  (PF+)  will  occur  at  a  vertex  of  the  continuous  constraint 
set  (2)  and  (4)  (Hirsh  and  Dantzig  [3]); 

2)  a  lower  bound,  L0,  on  the  sum  of  the  fixed  costs  can  be  found  by  solving  the 
set-covering  problem,  P&,  below  (McKeown  [5)). 


Min  L0-Z  fjy, 

(/*») 

(7)  Subject  to  Z&ijyj  ^  1.  ‘  €1 
JiJ 


(8)  yt  €  (0,1),  y  €  7 

|  1  if  0^  >  0 
”  j  0  otherwise, 


(9)  where 


i  \s v  soi c rio\  HiK  1 i\;  n  <  11  vk<;i  i'koiii  i  ms 


(.23 


We  will  combine  these  two  results  to  develop  a  solution  procedure,  the  essence  ol  which  is 
summarized  in  Theorem  1  below. 

THEOREM  1:  Let  =  (  / Ic,  =  1  in  an  optimal  solulion  to  I’J.  then  there  exists  a  feasi¬ 
ble  solution  to  (PE  +  )  such  that  ,v,  >  0  for  /  C  B £.  Furthermore.  this  solution  will  be  optimal 
for  (PF+). 

PROOF:  Given  an  optimal  solution  to  /JA,  we  must  show  that  there  exists  a  correspond¬ 
ing  solution  to  (PF+).  The  first  thing  to  note  is  that  each  column  of  the  constraint  matrix  (3) 
of  Pf,  in  has  at  least  one  nonzero  element  that  is  the  only  nonzero  element  in  that  row 
Otherwise,  the  set  would  be  over-covered  and  we  could  reduce  the  objective  value  of  hv 
removing  that  column  from  the  optimal  solution.  We  may  use  this  result  together  with  the 
nonnegativity  of  the  a,,  elements  to  construct  a  solution  to  (PF+)  using  B:. 

Assume,  without  loss  of  generality,  that  I B*\  =  A  and  that  the  decision  variables  have 

been  reindexed  such  that  |1 . k  \  C  B*.  i.e.,  the  lirst  A  variables  of  (PI  +  t  correspond  to 

the  optimal  basic  variables  of  PA.  We  can  now  construct  a  feasible  solution  to  <PI  +  I  using  the 
following  tw'o  rules: 

I )  ,\|  =  Max  \hj u,\\ 

a>  i  ^ 
i  €  / 


and  2)  xk  =  Max 


Max 

0.  a ,*  *  0 

I  €  / 


k  I 

h,  -  X.  "  v 


.  A  ^ 


This  proves  the  existence  of  a  solution  to  <PF+)  corresponding  to  B’  I  he  optimalitv  of 
this  solution  is  guaranteed  by  the  fact  that  both  </'„)  ami  (PI  +)  have  the  same  objective  value 
and  that  this  objective  value  for  Pn  is  a  lower  bound  on  (PI  +  )  Hence.  B‘  coi responds  to  an 
optimal  solution  to 

3.  COMPUTATIONAL  COMPARISONS 

Since  the  optimal  set  of  variables  for  (PF+)  can  be  found  by  solving  the  set-covering 
problem,  we  should  be  able  to  use  this  result  to  reach  quicker  solutions  to  (PI  +  i  We 
used  a  mixed  integer  programming  code  based  on  the  approach  of  lomlin  1 7 j  ,|S  extended  by 
Armstrong  and  Sinha  111  to  solve  the  set-covering  problems  Special-purpose  set-covering  algo¬ 
rithms  can  he  expected  to  perform  even  better.  Fixed  charge  test  problems  lust  genet ated  bv 
C  ooper  and  Drebes  (2 1  were  used  as  a  basis  of  comparison  between  this  set-coveimg  approach 
and  two  other  procedures.  I  he  first  such  procedure  is  a  branch-ami  hound  code  developed  hv 
McKcown  |(i|  specifically  for  fixed  charge  problems  while  the  second  procedure  used  the  same 
mixed  integer  code  as  before,  but  solved  (PF  +  I  as  a  mixed  integer  piohlem 

The  original  test  problems  were  of  dimension  s  »  Id.  but  l.ugei  piohlcnis  weie  genet. ited 
by  putting  these  smaller  problems  on  the  diagonal  I  sing  these  problems,  the  icsiilis  ot  nm 
comparisons  arc  shown  in  Table  I  below 
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TABLE  1 


Problem 

Set 

Size 

(m  x  n  ) 

Number 

of 

Problems 

Average  Solution  Time  per  Problem  in 

CPU  Seconds  on  CDC  70/74 

Armstrong 

and 

Sinha 

McKeown 

_  Set 
Covering 

Solutions 

1 

5  x  10 

12 

0.132 

0.049 

0.017 

10 

10  x  20 

6 

0.856 

0.345 

0.046 

4 

3 

1 5  x  30 

4 

3.039 

1.357 

0.101 

2 

From  the  table  we  can  see  that  the  set  covering  formulation  is  almost  three  times  faster 
than  the  best  alternative  approach  for  the  small  (5  x  10)  problems  ant)  up  to  13  times  faster  for 
the  larger  problems  (15  x  30).  We  have  also  noted  the  number  of  problems  for  which  the 
linear  programming  solution  was  integer  feasible  lor  the  set  covering  problems.  This  occurred 
in  over  half  of  the  cases. 

4.  CONCLUSIONS 

In  this  paper,  we  have  shown  that  a  fixed  charge  problem  with  nonnegative  constraint 
matrix  coefficients  and  all  continuous  costs  equal  to  zero  can  be  solved  by  solving  a  related  set- 
covering  problem  Computational  experience  confirms  that  this  procedure  yields  dramatically 
belter  solution  times  than  any  other  available  solution  procedure.  Even  quicker  solution  times 
can  be  expected  to  result  if  special  purpose  set-covering  codes  are  used. 
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ABSTRACT 

The  bounded  interval  generalized  assignment  model  is  a  "many -Ini -one'  as¬ 
signment  model  Each  task  must  be  assigned  to  exactly  one  agent,  however, 
each  agent  can  he  assigned  multiple  tasks  as  long  the  agent  resource  con¬ 
sumed  by  performing  the  assigned  tasks  falls  within  a  specified  interval  I  lie- 
bounded  interval  generalized  assignment  model  is  formulated,  and  an  algo¬ 
rithm  lor  its  solution  is  developed.  Algorithms  for  the  bounded  interval  vet 
sions  of  the  scmiassignmeni  model  and  sources-lo-uses  transportation  model 
are  also  discussed. 


1.  INTRODUCTION 

In  general  terms,  assignment  models  represent  problems  in  which  indivisible  tasks  are  to 
be  paired  with  agents.  Given  a  measure  of  utility  (or  disutiliiy)  associated  with  each  possible 
pairing,  the  objective  of  the  model  is  to  optimize  the  collective  utility  associated  with  assigning 
a  set  of  tasks  to  a  set  of  agents.  In  practical  applications,  the  number  of  tasks  typically  exceeds 
the  number  of  agents,  and  at  least  one  agent  must  be  assigned  two  or  more  tasks  if  all  tasks  are 
to  be  completed.  Examples  of  such  "many-tasks-for-one-agent"  problems  include  the  assign¬ 
ment  of  engagements  to  a  firm's  personnel  (20),  points  of  distribution  to  facilities  [15],  geo¬ 
graphic  units  to  district  centers  [21],  products  to  plants  111.  inventory  items  to  warehouses  [X|, 
harvestable  forest  compartments  to  a  labor  force  [121.  ships  to  shipyards  [111,  scholarships  to 
students  [181,  storage  compartments  to  commodities  [191.  jobs  to  computers  [31.  tiles  to  mass 
storage  devices  [2,131,  defect  checkpoints  to  inspectors  [17[.  and  trips  to  ships  [7|  The  feasi¬ 
bility  of  many-for-one  assignments  will  depend  on  the  agents'  abilities  to  complete  the  collec¬ 
tions  of  tasks  assigned  to  them.  That  is.  the  subsets  of  tasks  that  can  be  assigned  to  each  agent 
are  determined  by  the  total  amount  of  effort  available  to  the  agent  and  the  amount  of  effort 
that  each  individual  task  requires 


F/6  12/1 
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Several  many-for-one  assignment  models  have  been  developed  which  lake  into  account 
only  upper  limits  on  the  total  amount  of  effort  that  each  agent  may  expand.  Hach  of  these 
models  is  a  special  case  of  a  model  developed  by  Balachandran  [31  and  Ross  and  Soland  (14) 
called  the  generalized  assignment  model.  This  model  has  the  form: 

(U  iP)  minimize  :  =  £  ]£  c„.v„ 

lit  /£  J 

(2)  subject  to  £  x,j  =  1  for  all  j  €  7, 

if/ 

(3)  £  r„.\„  <  h,  for  all  /  €  /, 

(4)  ,v„  =  0  or  1  for  all  /  €  /,  /  €  7. 

where  /  =  [1,2 . ml  is  an  agent  index  set,  7=  [1,2,  ....  /;]  is  a  task  index  set.  c, 

represents  the  disutility  associated  with  an  agent  /',  task  j  assignment,  r„  >  0  denotes  the 
resource  burden  incurred  by  agent  /  in  completing  task  /,  and  b,  is  the  resource  available  to 
agent  i.  The  decision  variable  .v„  is  interpreted  as 

1  if  agent  /  performs  task  j 
0  otherwise 

Constraints  (2)  and  (4)  insure  that  each  task  is  uniquely  assigned  to  a  single  agent,  and  con¬ 
straints  (3)  insure  that  each  agent  expends  no  more  than  b,  resource  units  in  accomplishing 
assigned  tasks.  Differences  in  the  difficulty  of  tasks  and  differences  in  agents'  abilities  to  per¬ 
form  the  tasks  are  reflected  in  the  values  of  the  parameter  r,r 

The  special  cases  of  ( P )  place  various  restrictions  on  the  lorm  of  the  agent  resource  con¬ 
straint  (3).  Francis  and  White  19)  and  Barr.  Glover  and  Klingman  [51  have  addressed  the  prob¬ 
lem  in  which  constraints  (3)  have  the  form: 

(3a )  £  x„  ^  b,  for  all  /  €  /. 

/ 

Here  b,  denotes  the  number  of  jobs  agent  /  can  complete,  for  all  jobs  consume  only  one  unit  of 
an  agent’s  resource  when  the  agent  performs  the  task  (i.e.,  rn  =  )  for  all  /  €  /,  j  €  7  ).  The 
model  (1,2, 3a, 4)  is  a  generalization  of  the  standard  assignment  problem  of  linear  programming 
in  that  it  permits  an  agent  to  undertake  more  than  one  task.  It  has  been  called  the  generalized 
assignment  problem  by  Francis  and  White  and  the  semi-assignment  problem  by  Barr.  Glover, 
and  Klingman. 

Caswell  [6[,  DeMaio  and  Roveda  [81,  and  Srinivasan  and  Thompson  [16]  studied  the 
problem  in  which  (3)  is  replaced  by: 

(3b)  £  rr\lt  ^  b,  for  all  /  6  /. 

i‘  J 

The  model  (l,2.3b,4)  explicitly  considers  differences  in  the  difficulty  of  tasks  incorporated  in 
the  parameter  r,.  Srinivasan  and  Thompson  called  this  model  the  sources-to-uses  problem  to 
reflect  the  interpretation  of  the  model  as  a  transportation  problem  in  which  the  demand  at  the 
7-th  location,  r,,  is  to  be  supplied  by  a  single  source. 

Practical  considerations  frequently  require  that  the  agents  expend  a  minimum  total 
amount  of  effort  in  completing  assigned  tasks.  Placing  both  minimum  and  maximum  restric¬ 
tions  on  the  resources  each  agent  can  expend,  yield  assignments  which  neither  overburden  nor 
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underutilize  the  agents.  Such  restrictions  arise  in  most  personnel  planning  applications  1 20] . 
Managerial  policies  usually  require  an  equitable  distribution  of  work  across  agents.  Analagous 
restrictions  crop  up  in  other  contexts  as  well.  For  example,  in  machine  loading  models,  it  usu¬ 
ally  is  desirable  to  balance  machine  workloads  rather  than  allowing  some  heavily  loaded  and 
some  lightly  loaded  machines.  In  facility  location  models,  capacity  constraints  may  restrict  both 
the  minimum  and  maximum  size  of  a  facility  to  avoid  diseconomies  of  scale  associated  with 
plant  sizes  outside  of  a  reasonable  range,  to  permit  piecewise  linear  approximation  of  concave 
cost  functions,  or  to  restrict  both  the  minimum  and  maximum  number  of  facilities  115].  Simi¬ 
larly,  territory  design  procedures  for  problems  of  political  districting,  school  districting,  and 
sales  districting  require  an  equitable  distribution  of  some  entity  (such  as  voters,  minority  stu¬ 
dents,  or  sales  potential)  among  the  districts.  Finally,  in  some  applications,  upper  limits  on  the 
effort  an  agent  can  expend  may  be  irrelevant,  and  only  lower  limits  need  be  considered.  Such  a 
situation  arises  in  the  segregated  storage  problem  [19]  which  requires  only  that  a  minimal 
amount  of  storage  space  be  allocated  to  store  commodities  and  no  maximum  allocation  is 
specified. 

Thus,  from  the  standpoint  of  modeling  flexibility,  it  is  desirable  that  assignment  models 
consider  explicitly  upper  and/or  lower  bounds  on  the  efforts  agents  must  expend  in  completing 
assigned  tasks.  While  most  "many-for-one"  assignment  models  consider  upper  bounds,  lower 
bounds  have  largely  been  overlooked.  In  this  paper,  we  introduce  the  bounded  interval  gen¬ 
eralized  assignment  model  and  discuss  how  existing  algorithms  can  be  modified  to  accommo¬ 
date  lower  bounds  on  agent  workloads  for  this  model  and  its  special  cases. 

2.  THE  BOUNDED  INTERVAL  GENERALIZED  ASSIGNMENT  MODEL  AND 

ALGORITHMIC  CONSIDERATIONS 

The  bounded  interval  generalized  assignment  model  may  be  formulated  as  follows: 


(5) 

(P*)  minimize 

-’  =  XI‘ 

it  1  It J 

'//*;/ 

(6) 

subject  to 

II 

if 

1 

for  all ./  €  7. 

(7) 

V/ 

„  <  b, 

for  all  i  €  /. 

(8) 

X,J  = 

0  or  1 

for  all  /  €  /.  /€  J. 

Notice  that  (P*)  derives  from  (P).  Fortunately,  the  modeling  flexibility  achieved  through 
the  introduction  of  lower  bounds  a,  >  0  in  constraints  (3),  (3a),  or  (3b)  does  not  complicate 
significantly  the  computational  effort  required  to  solve  any  of  the  models  described  above. 
Rather,  as  we  shall  show,  straightforward  modifications  can  be  made  to  the  existing  algorithms 
for  the  semi-assignment  problem,  sourccs-to-uses  transportation  problem,  and  the  generalized 
assignment  problem.  The  interested  reader  should  consult  the  cited  references  for  the  details 
of  the  original  algorithms. 

In  the  case  of  the  semi -assignment  problem,  the  constraint  matrix  is  totally  uniniodular. 
and  integer  solutions  can  be  obtained  using  the  simplex  method.  To  impose  the  lower  limit. 
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one  need  only  add  an  upper  bounded  slack  variable  s,  <  -  a,  to  each  of  the  constraints  (3a) 

and  rewrite  them  as  equality  constraints.  Optimal  solutions  to  the  resultant  bounded  variable 
linear  program  will  be  integer  valued. 

Models  with  constraints  (3)  or  (3b)  are  not  totally  unimodular.  Hence,  the  solutions  of 
the  linear  programming  relaxation  (i.e.,  xu  >  0  for  all  ij)  need  not  be  integer.  Branch  and 
bound  approaches  have  been  developed  for  deriving  integer  optimal  solutions  which  solve 
linear  programming  relaxations  fpr  fathoming  and  to  compute  lower  bounds.  In  the  case  of 
(3b),  a  linear  programming  relaxation  is  the  standard  transportation  problem  [16];  and  in  the 
case  of  (3),  a  linear  programming  relaxation  is  the  generalized  transportation  problem  [3].  As 
in  the  case  of  the  semi-assignment  problem,  to  impose  constraints  (7)  or 

(7b)  aj  <  ^  rjXij  <  6,  for  all  /'  €  / 

jtJ 

in  a  linear  programming  relaxation,  one  need  only  add  upper  bounded  slack  variables 
s,  <  bj  —  aj  to  constraints  (3)  or  (3b)  and  rewrite  them  as  equality  constraints. 

The  algorithm  developed  by  Ross  and  Soland  [14]  for  the  generalized  assignment  problem 
does  not  solve  a  linear  programming  relaxation  to  determine  the  lower  bounds.  Instead,  a 
Lagrangian  relaxation  is  solved  in  the  form  of  a  series  of  separable  binary  knapsack  problems. 
The  Lagrangian  relaxation  has  the  form: 

(8)  (PK)  minimize  2A-££  c0  x0  +  ^X/l  -  X*/>) 

•  HljtJ  j(.J  HI 

subject  to  £  r0  Xjj  <  b,  for  all  /  €  / 

jij 

x>j  —  0  or  1  for  all  i  €  /,  j  €  J. 

The  value  of  each  X,  is  set  equal  to  c2j  ,  the  second  smallest  value  of  ctJ  for  all  i  €  /.  These  X, 
are  optimal  dual  multipliers  for  the  problem: 


(PL )  minimize 

£  £  CijXiJ 

HI  jtJ 

subject  to 

Zxu~  1 

for  all  j  €  7, 

HI 

0  <  x0  <  1 

for  all  i  €  /,  j  6  J. 

Thus,  determining  a  lower  bound  requires  two  steps.  First,  solve  (PL),  then  solve  (Px).  If  the 
primal  solution  X  -  (x0)  to_  (PL)  should  also  satisfy  (8),  then  Z  -  ZL  -  Zx,  and  (PK)  need 
not  be  solved.  Frequently,  X  will  not  satisfy  (8),  and  iPk)  must  be  solved  to  find  Zx. 

To  incorporate  the  lower  bounds  a,  into  the  algorithm,  one  need  only  replace  constraints 
(8)  by  constraints  (7)  giving  rise  to  the  problem  (Fx)  with  knapsack  constraints  bounded  both 
from  below  and  from  above.  Seemingly,  this  minor  modification  to  the  form  of  (Px)  should 
have  little  effect  on  the  algorithm.  However,  it  must  be  noted  that  (Pk)  will  involve  fewer  0-1 
variables  and  may  be  easier  to  solve  than  (P*).  The  reason  is  best  explained  by  considering  an 
equivalent  form  of  the  objective  function  of  (Pk): 

Zx  -  £xy  -  maximum  [£  £  (X,  - 

jtJ  \  HljiJ  I 


1(1)1  SIX  I)  ISM  H\  \l  ).l  SI  K  \|  I/I  |)  XSSHiWII  SI  I'Killll  I  M 
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Clearly,  wiih  constraints  (X).  one  can  sei  any  equal  to  zero  which  has  an  objective  function 
coefficient  (X,  -  c  ,  I  5.  0  Ihus.  using  the  values  of  X,  calculated  from  solving  (Pj ).  U\ ) 
reduces  to  a  problem  involving  at  most  n  0  -  I  variables.  Such  a  reduction  is  not  possible  for 
</*;>. 


In  addition  to  providing  a  lower  bound,  the  solutions  to  (Pr)  and  (P*)  may  be  used  to 
select  a  branching  (or  separation)  variable  for  defining  subsequent  candidate  problems.  As 
noted  above,  the  solution  to  ( P, ),  .V,  is  usually  not  feasible  to  (7).  In  essence,  the  solution  to 
(P*),  X  =  (,v„),  may  be  interpreted  as  recommending  changes  in  ,V  which  must  be  made  in 
order  to  satisfy  (7).  That  is,  it  is  possible  that  for  some  j  €  J ,  £.v„  =  0  to  avoid  overloading 

it  / 

any  agent  or  £  .v„  >  1  to  insure  every  agent  uses  a  minimum  amount  of  his  resource.  Those 

it 1 

variables  v„  with  an  optimal  value  of  one  indicate  agent-task  pairings  that  should  be  made; 
whereas,  those  .v„  with  an  optimal  value  of  zero  indicate  pairings  that  should  be  avoided.  Thus, 
these  variable  values  indicate  changes  that  will  reduce  the  aggregate  infeasibility  of  ,V  in  (7), 
and  they  are  helpful  in  choosing  a  branching  variable. 

To  formalize  the  concept  of  reducing  aggregate  infeasibility,  we  define  the  infeasibility  in 
constraint  /  prior  to  taking  a  branch  to  be 

I),  =  max  (0,  d,  | 

where  </,'  =  £  rux„ 

M  J 

(/,  —  (J,  —  r„.\„. 

n  J 

The  set  !'  =  1/  6  l\dl'  >  0)  identifies  those  constraints  (7)  for  which  .Vexceeds  the  upper 
bound,  and  /  =  (/'  €  l\d,  >  0)  identifies  those  constraints  (7)  for  which  V  fails  to  satisfy  the 
lower  bound. 

Suppose  I'  ^  H  and  A  6  (/  €  J  |.v„  =  I  and  i  €  /’I;  if  ,v,*  is  set  to  0  then  d,'  and  </, 
become: 

4'  =  X  r"*‘i  ~  ~ 

M  j 

</,  =  «,  -  X  r<t  *1/  +  '.A 
/«  ./ 

and  the  resulting  infeasibility  in  constraint  i  becomes 
/),*  =  max  (0,  </,*.  d,  |. 

Assuming  that  task  A  is  reassigned  to  the  second  least  costly  agent,  (say  agent  //,  where 
(  „<,  =  min  <  n  )  then  the  infeasibilitv  in  constraint  h  becomes 

Of 

l>t  =  max  (0.  d,' .  dh  \ 

1 4.  ~  £  rh,  'll  1  ~  hi,  +  r i,i 
:■  J 

di,  </„  ~  fl,,  ^tik‘ 

!•  / 


J 


where 
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Hence,  the  net  difference  in  total  infeasibility  is: 

A D*  =  (D,  +  D„)  -  (D,*  +  D£) 

If  A D*  >  0  then  setting  xlk  =  0  yields  a  reduction  in  aggregate  infeasibility,  and  if  AD*  <  0 
then  such  a  branch  will  not  reduce  aggregate  infeasibility. 

Similarly,  suppose  that  l  and  k  €  {7  6  J\xt)  =  0  and  /  €  /“);  if  x,k  were  set  to  1 
then  d*  and  d,  become: 

d,+  =  L  rn  -  b,  +  r,k 
i tJ 

d,'  =  a,  -  Z  r„  x,j  -  rlk 

jtJ 

and  the  resulting  infeasibility  in  constraint  /  would  be 
D,*  =  max  (0,  d,+  ,  d,~ j. 

If  x,k  is  set  to  1  then  task  k  is  assigned  to  agent  /'  and  agent  ik  relinquishes  it,  where 
4  =  min  c,k .  Hence,  the  infeasibility  for  constraint  4  becomes 

I 

D*  =  max  (0,  d*.  d~  1 

where 

d. f  =  T  r.  .  x,  ,  €  —  b,  -  r,  , 
l*J  * 

d,  — a,  -  y r, ,  x, ,  + r,  k- 

nj 

The  net  difference  in  infeasibility  is 

AD*  =  (D,  +  D,k)  -  (D*  +  D*  ) 

where  D,  and  D,s  are  the  infeasibilities  in  constraints  i  and  4  prior  to  any  branch  As  before,  if 
AD*  >  0  then  there  is  reduced  infeasibility  following  a  branch  on  variable  .v,*. 

Several  rules  for  selecting  the  branching  variable,  x,v,  are  formulated  as  follows: 

I.  a)  .x>;«  is  that  variable  for  which 

AD/.*=  max  I  AD/) 

U.  ,)>  II'  III 


where 

ir  =  1 

(/,./) 

|.v„  =  0  and  /  €  /*) 

H  =  1 

</.,/) 

|.v„  =  1  and  /  €  /"} 

h)  IfAD/.* 

=  0  in  a) 

then 

v,.,.  is  that  variable  for  which 

11 

V- 

1/  1  )  > 

max  |AD/| 

a,  ■  ir  1 1  K,  11  1 

where 

(>'  =  ( 

</../) 

|v„  =  1  and  /  (  /') 

a  =  ( 

0.  ./> 

|.v„  =  0  and  /  C  /  ). 
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II.  a)  -v,.,.  is  that  variable  for  which 


min  Ix.-cJ 

min 

C  II  i  ikj  I 

,/.y)6CG  W 

(/.  j)  €  G 

AD/  | 

b)  If  AD/  =  0  for  all  (/.  /)  €  G +  UG  then  a>,.  is  that  variable  for  which 


max 

TV ,  —  f,/ 

(/,./)  6  £+ 

fj < t>,  z  r„) 

J<  /, 

where  F*  =  {(/'.  ,/)|.v„  =  1  and  /  €  Dj, 

F,  denotes  the  set  of  tasks  assigned  to  agent  /  by  prior  branching. 

Rules  la  and  lb  are  designed  to  choose  that  variable  which  reduces  the  post  branch  aggregate 
infeasibility  by  the  greatest  amount.  Rule  Ha  conditions  the  choice  of  branching  variable  on  the 
additional  cost  incurred  per  unit  reduction  in  infeasibility.  Rule  lib  is  the  one  used  in  1 14k  the 
variable  chosen  by  this  rule  represents  an  agent-task  pairing  which  should  be  made  considering 
the  penalty  for  not  doing  so  weighted  by  the  fraction  of  the  agent's  remaining  free  resources 
consumed  by  the  assignment. 

As  the  algorithm  progresses  and  new  candidate  problems  (CPs)  are  defined  by  the  branch¬ 
ing  process,  the  additional  steps  given  below  may  be  taken  to  facilitate  fathoming.  These  steps 
are  specialized  adaptations  of  more  general  forcing  (or  variable  fixing)  tests  suggested  by  Balas 
[4]  and  Glover  1 1 01 . 

In  solving  any  (CP),  any  ,v,y  for  which  r,y  >/>,•-  Y  v, •,/•,•,  may  be  set  equal  to  zero 

/•  t . 

Here  /y  denotes  those  j  €  J  for  which  xy,  has  been  assigned  a  value  of  /ero  or  one  by  prior 
branching  or  variable  fixing  tests.  Similarly,  if  there  is  an  .v,y  for  which  a,  -  Y 

<•  /  • 

.\y,/y,  >  Y  then  ,v,y  must  be  set  equal  to  one  in  the  solution  to  (CP)  These 

!•  j  t; 

variable  forcing  tests  may  subsequently  result  in  other  variables  being  forced  to  zero  or  to  one 
when  all  of  the  resultant  implications  are  considered.  Moreover,  forcing  certain  variables  to 
zero  or  to  one  in  the  solution  to  (C’P)  may  affect  the  values  of  some  of  the  A  obtained  from 
solving  (P*).  This  change  mav.  in  turn,  increase  the  value  of  the  lower  bound  provided  b\ 

<p;>. 


Another  test  may  be  used  to  check  the  feasibility  of  IP*)  (or  any  candidate  subproblem) 
Summing  the  constraints  (7)  together  yields  the  constraint  19): 

<9>  ^  =  Iffi  ^  I  I  rnxn  ^  Z  h>  =  H 

i 1  I  ,<  I  I<  .1  n  ! 

This  new  constraint,  together  with  constraints  (6),  implies  that  for  any  feasible  solution  to  (/”  ) 
we  must  have: 

Y.  r\  ^  .4  and  £  r  ]  <  H 

,  •  j  , . 


GO) 
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where 

r]  =  max{r„|  and  r"=  min  (r„). 

ft  /  I<  I 

The  values  necessary  for  the  tests  (10)  can  be  updated  easily  as  part  of  the  branching  process  in 
order  to  apply  this  test  to  each  (C'P). 

The  algorithm  terminates  in  the  usuai  way  when  all  candidate  problems  have  been 
fathomed. 

3.  CONCLUSION 

This  note  has  described  an  efficient  branch  and  bound  algorithm  for  the  bounded  interval 
generalized  assignment  problem.  The  algorithm  serves  as  a  useful  tool  for  solving  a  large 
number  of  applications  of  this  assignment  model,  a  representative  sample  of  which  is  men¬ 
tioned  in  the  introduction. 
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ABSTRACT 

In  this  paper  we  are  concerned  with  several  random  processes  that  occur  in 
M/G/l  queues  with  instantaneous  feedback  in  which  the  feedback  decision  pro¬ 
cess  is  a  Bernoulli  process.  Queue  length  processes  embedded  at  various  times 
are  studied  It  is  shown  that  these  do  not  all  have  the  same  asymptotic  distri¬ 
bution.  and  that  tn  general  none  of  the  output,  input,  or  feedback  processes  is 
renewal.  These  results  have  implications  in  the  application  of  certain  decompo¬ 
sition  results  to  queueing  networks 


I.  INTRODUCTION 

In  this  paper  we  are  concerned  with  several  random  processes  lhat  occur  within  the  class 
of  M/G/l  queues  with  instantaneous  feedback  in  which  the  feedback  decision  process  is  a  Ber¬ 
noulli  process.  Such  systems  in  the  case  G  =  M  are  among  the  simplest,  nontrivial  examples 
of  Jackson  networks  (8].  Indeed,  they  are  so  simple  that  they  are  usually  dismissed  from  con¬ 
sideration  in  queueing  network  theory  as  being  obvious.  We  will  show  that  far  from  being 
obvious,  they  exhibit  some  important  unexpected  properties  whose  implications  raise  some 
interesting  questions  about  Jackson  networks  and  their  application. 

In  particular,  Jackson  l8l  observed  lhat  in  his  networks  the  vector-valued  queue  length 
process  behaved  as  if  the  component  processes  were  independent,  M/M/I  systems  Since  those 
results  appeared  there  has  developed  a  mythology  to  explain  them.  These  arguments  usually 
rest  on  three  sets  of  results  that  are  well  known  in  random  point  process  theory:  superposition, 
thinning,  and  stretching.  By  examining  the  network  flow,  it  will  be  shown  that  the  applications 
of  these  results  are  inappropriate  for  queueing  networks  with  instantaneous,  Bernoulli  feedback 
These  flows  are  considerably  more  complicated  than  one  expects  based  on  such  arguments. 


Mhe  research  was  supported  under  ONR  ( Dniracls  MMM)14-7S(  -04d2  f\K042  2^M  .tndNoooM  ~~  (  iT4'  <\Ro42  2‘*M 


636 


Kl  l)ls\l  >  I)  (  MONK  kl  I  \NI)  tt  SIMON 


li  is  shown  that  in  general,  both  the  input  and  output  processes  of  the  M/M/1  queue  with 
feedback  are  Markov -renewal,  and  the  kernels  of  those  Markov-renewal  processes  are  given. 
The  output  of  the  M/G/l  queue  with  feedback  is  also  Markov-renewal,  and  that  kernel  is  given. 
It  is  shown  that  in  general  these  processes  are  never  renewal.  The  implications  of  these  facts 
are  discussed  in  Section  4. 

1.1  The  Problem  and  Notation 


We  assume  the  usual  apparatus  of  an  M/G/l  queue  with  unlimited  waiting  capacity.  The 
new  idea  is  that  a  unit  which  has  received  service  departs  with  probability  q  and  returns  for 
more  service  with  probability  p  />  +  (/=  1 .  Without  loss  of  generality  for  the  processes  stu¬ 
died  here,  the  returning  customer  can  be  put  anywhere  in  the  queue. 


To  establish  notation  it  is  assumed  that  the  arrival  process  is  a  Poisson  process  with  param¬ 
eter  K  >  0.  The  arrival  epochs  are  the  elements  of  {H„:  n  —  1,2,  ...).  Service  times  are 
independent,  identically  distributed,  nonnegative,  random  variables,  S„  with 

Pi  | .S',  sj  i]  =  //It).  i  >  0. 

m,i  <  °°. 

We  define  //Ms),  the  l.aplace-Stieltjes  transform  of  // (/).  by 
//Ms)  =  f  t  e  M///(t>.  Re  ,v  ^  0. 

The  arrival  process  and  service  times  are  independent  processes. 


Service  completions  occur  at  ’/'11  <  f)  <  7\...  called  the  ot/ipui  epochs,  l.et 


>Y  = 


0.  if  the  n- th  output  departs, 

1,  if  the  11  -th  output  feeds  back. 


i  )„|  is  a  Bernoulli  process. 


Idements  of  the  subset  |t„!  C  l7'„!  are  called  the  departure  epochs  and  are  the  times  at 
which  an  output  leaves  the  system.  The  elements  of  the  subset  (r„|  C  {  T„\  are  called  the  feed¬ 
back  epochs  and  are  the  times  at  which  an  output  returns  to  the  queue.  (t„|  U  (t„1  =  [T„\. 

The  times  /',’,  are  the  times  at  which  a  unit  enters  the  queue.  { 77,1  is  called  the  input  pro¬ 
cess.  !/;;}  =  («;,}  u  1  rj 

There  are  live  queue  length  processes  to  be  studied.  They  are  closely  related  as  will  be 
shown.  L.et  Qit)  be  the  queue  length  (number  in  the  system)  at  t.  Then, 
C>,  in)  -  K  -  0).  Q:  in)  =  Q  ( T‘„  -  01.  Ql  <//)  =  Q(T„  +  0);  (?4*  in)  =  Qit„  +  0)  are 
respectively  the  embedded  queue  lengths  at  arrival  epochs,  input  epochs,  output  epochs,  depar¬ 
ture  epochs 

2.  QU  IT.  I.F.MiTH  PRO<  KSSKS 

The  queue  lengths  listed  in  Section  1.1  arc  closely  related.  The  steady  state  versions  of 
and  |(7t'(»)}  are  of  primary  concern  They  are  studied  in  Sections  2.1  and  2.2 
separately.  They  are  related  to  the  other  processes  in  Section  2.3  The  important  special  case 
for  G  •  M  is  then  studied  in  2  4 


\I/C,I\  Ql'I-l'E  WITH  INS TaNTANHoi is  III -KN()l<|  I  I  I  I  I  I1H\(  K 
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feedback  process 


Bernoulli 

switr'i 


arrival 

process 


input 

process 


output 

process 


Rill  Kl  | 


2.1  The  i£>4+  (n)l  Process 


There  are  several  ways  10  study  this  process.  The  following  appears  to  he  direct,  correct, 
and  may  help  explain  why  these  feedback  problems  have  received  such  little  attention  in  the 
queueing  literature.  First,  it  is  clear  that 


i„  i  +  5,;.  if  (?;  («  -  1)  >  0. 
f»  i  +  4,  +  S,',.  if  (?44  (tt  -  |)  =  0. 


Here  S'„  is  the  total  service  time  consumed  between  the  (n  -  1)  -  si  and  «-th  departure.  I„  is 
the  idle  time  following  V  t  when  Q+  (n  -  l)  =  0.  For  the  M/G/l  queue,  the  / 's  are  indepen¬ 
dent,  identically  distributed,  random  variables  that  are  exponentially  distributed  with  parameter 
A 


Without  loss  of  generality,  since  customers  are  indistinguishable, 

S'„  =  S,  +  Si  +  . . .  sm , 

where  m  is  the  number  of  services  performed  between  the  (n  -  1)  -  si  and  ri-th  departure. 
Since  I  Tj  is  a  Bernoulli  process,  m  is  geometrically  distributed  and  it  follows  that  (.S'/,)  is  a 
sequence  of  independent,  identically  distributed,  random  variables.  Thus,  the  Laplace-Sticltjcs 
transform  of  the  distribution  function  of  S'„  is  easily  found  to  be 

O’*  (  s )  =  qH*(s )/ 1 1  -  pH*  ( s ) | . 


Using  standard  embedded  Markov  chain  methods  13.  167-174)  one  finds  that  the  probabil¬ 
ity  generating  function  of  J"J',  the  limiting  probability  distribution  of  ( Or  tut),  is  given  bv 


(1) 


7r'(0)  (c  -  1)  c,*{\  -  \z) 
z  ~  G*(A  —  A: ) 


and 

<21 


rr'(0)  =  I  -  A Sn\/q. 
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If  one  is  willing  to  assume  that  the  M/G/l  queue  with  instantaneous,  Bernoulli  feedback 
has  a  queue  length  process  which  asymptotically  has  the  same  distribution  as  another  M/G/l 
queue  without  feedback,  then  (1)  and  (2)  follow  immediately.  This  assumption  is  valid  since  if 
customers  feedback  to  the  front  of  the  queue,  the  total  service  time  of  the  n-th  customer  is  S  . 
(Sj  is  a  sequence  of  independent,  identically  distributed,  random  variables  with  mean  E\S„\/q. 
Alternatively,  one  can  argue  that  the  M/G/l  queue  with  feedback  (as  defined  here)  has  the 
same  asymptotic  distribution  for  its  queue  length  process  as  an  M/G/l  queue  without  feedback 
if  one  takes  the  arrival  process  parameter  in  the  latter  case  to  be  k/q.  Indeed,  both  of  these 
assumptions  and  several  others  that  are  used  to  "prove"  that  these  queues  with  feedback  are 
trivial  have  now  been  proven  by  the  arguments  leading  up  to  (1)  and  (2).  That  these  argu¬ 
ments  can  be  applied  more  generally  is  easily  proven.  In  the  remainder  of  this  paper,  in  Takacs 
[  1 01  and  in  Disney  [6]  it  is  shown  that  while  these  arguments  may  imply  that  the  study  of 
queue  lengths  at  departure  times  is  trivial,  the  same  cannot  be  said  for  other  processes  of 
interest. 


2.2  The  1 Q)  («)}  Process 

T  his  is  the  queue  length  process  embedded  at  output  points.  Since  {t„ I  C  j 7„  1 .  (£>;  (/,)! 
is  a  process  on  a  coarser  grid  than  (CK  Dill.  Since  one  is  ultimately  to  be  concerned  with  both 
{(2/  (/i))  and  \T„  -  /'„  |).  the  following  study  is  for  the  joint  process  { C^T  ( n ).  T„  —  ,|. 

The  marginal  results  for  {(2/  (//)[  then  will  be  easy  to  determine. 

THEOREM  1:  The  process  ((2,+  («),  T„  -  T„  ,1  is  a  Markov-renewal  process  with  kernel 
A  ( i.j.x )  =  Pr{Q?  in)  =*=  ./',  T„  -  T„  ,  <  .v|(2,+  (/;  -  I)  =  /).  If  one  defines 


P,(.v)  =  UylV  *7 


./  =  0.1,2 . 


■I  I  /.  /.  x  )  = 


if./  <  /  -  1, 


J~  (P,  | (y)p  +  Pt  ,  | (y)q)dH(y).  if  /  ^  0, 


£'  <1  -  c  AU  ")  (P,  ,(.iT/»  +  P,(y  )</)</«  (y).  if  I  =  0, 

./  >  I). 

f  <1  -  c  A"  11  )P„(v )</<///(. i  >,  if./  =  ,  =  0. 


/  >  i  -  1 . 


PROOI 


T„  -  ,  = 


•S„.  if  (2 /(/;-  I)  >  0, 
A,  +  S„.  if  (/,  -  1)  =  (). 


where  /,  is  the  exponentially  distributed  idle  time  preceeding  S„  if  C>,*  in  -  1)  =  0.  The  result 
then  follows  directly  using  arguments  as  in  (5).  13 

As  v  —  A  ( ij.x )  —  A  (/,./)  the  one  step  transition  probability  for  the  { (2,'  («))  process. 
Then  using  standard  embedded  Markov  chain  results  13.  167-1741  one  can  show  that  the  proba¬ 
bility  generating  function  g(r)  for  the  limiting  probabilities  i r(  /l  of  (2/  in)  are  given  by 

( 3 1  ( - )  =  ,r^)  ~  1 1  </>c//*( A  -  \c)  +  qH‘(k  - 

K  ‘  z  -  pzUMk  -  k:)  -  qH'ik  -  kz) 


and 

(4) 


\m,n  qi 'i  t  i  with  instantasi  ot's  hi  knoi  i  i  i  i  i  i  dh\<  k 
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2.3  Other  Queue  Length  Processes 

The  queue  length  and  limiting  probabilities  for  the  queueing  processes,  { C? i  (nil, 
1(?2  (n)l  now  follow  from  a  theorem  found  in  Cooper  [3,  155],  From  this  it  follows  that 
{(3(f)),  (£>,  (n)|,  and  {0/ (n)|  are  asymptotically,  identically  distributed  (see  Cooper  [3,  65]) 
and  Q2  (n),  and  ((?/(«)]  are  asymptotically,  identically  distributed.  Clearly,  {QJ  (n))  and 
l(?3+(n))  are  not  asymptotically,  identically  distributed.  That  [Qf  (n)]  and  {£>,’  («)]  are  not 
asymptotically,  identically  distributed  can  be  seen  as  follows.  First,  in  the  set  up  of  studying  the 
U?3+  (n)J  process  one  must  decide  how  to  count  the  feedback  customer  when  he  appears.  The 
clean  way  to  do  this  is  to  use  Y„  as  defined  in  Section  1.1  and  ( n )  as  the  number  in  the 
queue  not  including  the  outputting  customer.  Then  one  can  study  the  process  |  Y„,  *£>,*  («)). 
Indeed,  this  is  precisely  the  direction  used,  for  example,  in  d'Avignon  and  Disney  |4],  Then 
the  {03+  (n))  of  Theorem  1  above  would  be  the  {*Q /  ( n )  +  Y„ )  process  of  [4],  It  then  follows 
that  ( *Qj  («))  and  re  asymptotically,  identically  distributed.  Thus,  if  one  does  not 

count  the  feedback  customc  t  the  queue  length  process,  the  queue  length  processes  defined  in 
Section  1.1  are  all  asymptoticaly,  identically  distributed. 

2.4  The  M/M/l  Case 


If  one  assumes  that  the  service  time  distribution  is 


HU)  «  1  -  e  “',  t  >  0, 

some  further  clarification  is  possible  here.  From  the  results  of  Jackson  18], 


A 

1  -  — 

9M 

9M 

From  (3)  and  (4)  one  obtains 


tt  (0)  =  q\ 


1  - 


A 

9M 


1  -  — 

A 

/-  1 

,  A 

P  +  — 

9M 

4M 

Comments  in  Section  2.3  explain  this  difference  between  tt(  J)  and  7 t'(  /  ). 


3.  FLOW  PROCESSES 


To  further  clarify  the  problems  here,  it  is  useful  to  study  the  (low  processes  in  this  sys¬ 
tem.  There  are  five  processes  of  interest:  the  arrival  process,  the  input  process,  the  output  pro¬ 
cess,  the  departure  process,  and  the  feedback  process. 

There  have  been  some  questions  since  the  publication  of  the  Jackson  results  concerning 
the  interpretation  of  his  results  (21.  In  his  paper  Jackson  showed  that  for  his  networks  the  joint 
limiting  probability  for  the  vector  of  queue  lengths  at  each  server  could  be  factored  into  limit¬ 
ing  probabilities  for  the  queue  length  at  each  server.  This  imples  that  the  queue  lengths  are 
independent  in  the  limit.  Furthermore,  the  marginal  limiting  probabilities  were  found  to  be 
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precisely  those  of  an  Ml  Ml  I  queue.  Burke  (21.  has  argued  that  the  Jackson  results  are  surpris¬ 
ing.  Burke's  argument  is  based  on  showing  that  the  input  to  a  single  server  queue  with  feed¬ 
back  is  not  Poisson  because  the  interinpul  times  (our  { 7'/,  -  7”  |!>  are  not  exponentially  distri¬ 
buted.  (2]  gives  the  precise  result 

Pr{  T'„  -  7,;  |  ^  /)  =  I  —  W  ~  A  c  A'  -  e  ^  0. 

p  —  A  p  —  A 

In  this  section  we  will  study  some  of  the  (lows  in  this  network  and  show  indeed  that  the 
Jackson  results  are  surprising. 

3.1  Departures 

The  departure  process  (rj  can  be  studied  as  in  Disney,  Farrell,  deMorais  (51  upon  using 
the  mapping  in  Section  2.1.  Thus  we  know  that  whenever  (Sj  is  a  renewal  process  with 
exponential  distribution  this  departure  process  is  a  renewal  process,  and  is  a  Poisson  process. 
This  is  the  Jackson  case.  So  we  conclude  that  the  departure  process  from  the  Jackson  network 
is  a  Poisson  process. 

From  the  results  of  Section  2.1  it  would  seem  possible  that  the  departure  process  is  Pois¬ 
son  even  if  S„  is  not  exponentially  distributed.  The  result  that  is  needed  for  the  results  of  |51 
to  follow  is  that  S,'  be  exponentially  distributed  (since  it  is  known  that  |S,’)  is  a  sequence  of 
mutually  independent,  identically  distributed,  random  variables). 

I.FMMA  I:  The  departure  process  from  the  MKI/I  queue  with  feedback  is  a  renewal  pro¬ 
cess  if  and  only  if  S„  is  exponentially  distributed  for  every  n.  In  that  case  the  departure  process 
is  Poisson. 


PROOF:  From  Section  2.1  we  have  (»*  (x ),  the  l  aplace-Stielljes  transform  of  the  distribu¬ 
tion  functions  of  .S',',  is  given  bv 


<7*(.v) 


qll Ms) 

I  -  /)//*(  s  I 


From  15] .  when  the  queue  capacity  is  infinite  the  departure  process  will  be  a  renewal  pro¬ 
cess  if  and  only  if  5^  is  exponentially  distributed  with  parameter  a,  and  will  be  Poisson  in  that 
case.  But  this  implies  that  H*(s)  must  satisfy 

a/ (a  +  v)  =  <///*  (.v )/ 1 1  -  pi  I*  ( v )  I . 

The  only  solution  here  is 


a/q  +  v 

which  implies  I  Hi)  is  exponential. 


3.2  Outputs  and  Inputs 


From  Section  2.2  it  is  clear  that  (he  output  process  is  a  Markov-rcncwal  process  whose 
distributions  are  given  by  A  ( i.j.x ).  From  these,  the  following  results  are  obtained. 

THFORFM  2:  The  output  process  { 7„  -  T,,  ,1  is  a  renewal  process  if  and  onlv  if  q  =  I 
and  HU)  =  I  -  c 
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PROOF:  If  q  —  1  and  Hit)  —  1  —  e  the  output  process  and  departure  process  are 
identical  processes.  Furthermore,  the  processes  are  both  departure  processes  from  a  M/M/1 
queue  without  feedback.  From  [5]  we  have  that  this  departure  process  is  a  Poisson  process  and 
"if1  follows.  To  prove  "only  if  we  consider  the  contrapositive  statement  and  assume  q  &  1 
(The  other  side  of  the  contrapositive  would  have  Hit)  ^  1  —  e  '".  But  then  “only  if  follows 
trivially  from  15],  Thus,  we  need  only  consider  the  case  of  q  ^  1.)  Equations  (3.1)  and  (3.2) 
in  [51  can  be  modified  in  such  a  way  that  one  can  show  that  if  q  ^  1,  there  is  no  solution  to 
both  of  those  equations  simultaneously.  Then  using  the  same  arguments  as  in  [5)  one  has  that 
[T„  -  7”,, _ | )  is  not  a  renewal  process  and  therefore  “only  if  is  proven.  □ 

To  be  more  specific.  Theorem  2  can  be  particularized  as 

COROLLARY  1:  The  output  process  {T„  —  T„  ()  for  the  M/M/1  queue  is  a  Poisson  pro¬ 
cess  if  and  only  if  q  =  I.  One  can  prove  this  result  (in  fact  it  is  obvious)  directly  from 
Theorem  2.  The  following  is  an  alternate  proof  that  exposes  a  bit  more  of  the  properties  of 
these  systems.  Again  we  use  a  contrapositivc  proof  for  "only  if. 

PROOF:  Define 

Fix)  -  Pr\T„  -  T„  ,  ^  .v). 

Fix)  =  nAU  where  U  is  a  column  vector  all  of  whose  elements  are  1,  n  is  the  vector  of  limn¬ 
ing  probabilities  given  in  Section  2.4  for  [Qf  (;/))  and  A  is  the  matrix  of  A  ii.j.x).  Then  from 
Theorem  I  one  obtains  after  some  algebraic  manipulations: 

(5)  Fix)  —  q-  —  f' [I ,’|(///(t)+  p+—  // ( v ) 

/J.  fJL 

for  any  M/G/\  queue  with  instantaneous,  Bernoulli  feedback. 

For  //(,t  )  =  1  -  c  M',  it  follows  that 

(6)  Fix)  —  1  -  qtLr  X  c~A> - v  ^  0. 

/x  —  X  fj.  —  X 

Thus,  single  intervals  are  not  exponentially  distributed  and  the  output  process  is  not  a  Poisson 
process  if  q  ^  I.  On  the  other  hand  if  q  —  I.  then  we  fulfill  the  conditions  of  I  heorcm  2 
Hence,  IT,,  -  T„  \)  is  a  renewal  process.  But  from  (6)  this  renewal  process  has  exponentially 
distributed  intervals  and  thus  is  a  Poisson  process.  !  ' 

Formula  (6)  was  previously  found  by  Burke  [2)  for  the  distribution  of  times  between 
inputs.  The  input  process  can  be  analyzed  as  follows: 

THEOREM  3:  ir  Hix)  =  1  the  process  |C>:  («>,  T'„  ~  T'„  ,|  is  a  Markov- 

renewal  process  with  kernel 

Yii.j.x)  =  /M£>2  in)  -  j,  T'„  —  T'n  ,  <  x|(?}  (n  -  I)  =  i] 


given  by 
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where  <///'"+1,(s)  =  &(Ps)"e  M'  ds. 

«! 

PROOF:  Clearly,  if  j  >  /  +  1  then  YU.j.x)  =  0.  If  j  -  0  then  YU.j.x)  is  the  probability 
that  the  i  +  1  customers  in  line  all  depart  before  *  and  the  first  arrival  occurs  after  the  last 
departure,  but  before  .r,  or,  the  first  /  -  1  customers  depart,  but  the  last  one  feeds  back  before 
x,  and  there  are  no  arrivals  while  this  is  happening. 

If  1  <  j  <  /  then  Y(iJ,x)  is  the  probability  that  i  -  j  +  1  customers  depart  before  .v.  no 
arrivals  occur  during  this  time,  but  between  'he  last  departure  and  a,  an  arrival  occurs  before  a 
departure;  or,  /  —  /  +  1  customers  are  served  before  ,v,  the  first  /'  -  j  depart,  the  last  one  feeds 
back,  and  there  are  no  arrivals  while  this  is  happening. 

If  j  =  /  +  1  then  YU.j.x)  is  the  probability  that  there  is  an  arrival  before  a  and  no  depar¬ 
tures  before  x.  Since  Yii.j.x)  never  depends  on  (A);  A  <  n  -  1)  or  {7)':  A  <  /;),  the 
process  I Qj  in),  T'„  —  T‘„  ||  is  a  Markov-renewal  process,  r 

Now,  if  Yix)  is  the  matrix  whose  elements  are  YU.j.x),  tt  is  the  vector  of  probabilities 
found  in  (3)  and  U  is  a  vector  all  of  whose  elements  are  I  then  it  is  easy  to  see  that 

Fix)  =  Pr\T„  -  r,;_,  x)  =  t r  Yix)U 
and 

Gi.v.r)  =  PriT'  -  r„.x  <  a,  r,;+t  -  r;  <  y]  =  n  ylx)  yuuu. 

where  Fix)  is  the  Fix)  given  by  (6).  Of  course,  if  (77,  -  7*/,  - 1 }  is  to  be  a  renewal  process  then 
it  is  necessary  (but  not  sufficient)  that 

G’ix.y)  =  Fix)  Fir). 

From  this  we  can  conclude: 


COROLLARY  2:  The  input  process  to  the  M/M/I  queue  with  instantaneous,  Bernoulli 
feedback  is  not  a  renewal  process  unless  q  —  1. 

PROOF:  If  q  —  1  then  the  input  process  is  just  the  arrival  process  which  is  Poisson. 

If  the  input  process  is  a  renewal  process  for  q  ^  I  then  it  must  be  true  that 
V-V.  it  Yix)U  =  Fix) 

V.v.y;  77 ■  Y  ix)  Y  i  v)  U  —  Fix)  Fi \)  where 
Fix)  is  given  by  (6)  and  U  is  a  column  of  l’s.  Thus, 

7 T  Y  (.V  ) 


V.v.y ; 

Some  algebra  yields 
7 T  Fix) 


Fix  i 


K  i  )  L  =  0. 


Fix 


Yiv)U  = 


Fix)  -  II  -  f 


Fix  1 


q  *  I. 


Fix)  -  <1 


-  1>K  c  *' - CP—,-  <"  +  pc  'c'"' 

fx  —  A  n  —  K 


JLIL 


M  -  A 


<;  w  >  „  PR _ F  k  »  _  _  PJL _ i  _  () 

H  —  K  n  -  A 


L 


j 
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Thus,  to  show  that  the  input  process  is  not  renewal,  it  suffices  to  show  that  lor  some  r, 

_ Eh _ (.-A  >■  _  _ EB. _ e  mi  +  pc '  •**  ’  * 11  0. 

fi  —  X  fx  —  X 

The  third  term  of  the  Taylor  expansion  of  this  expression  is 

P *  A 1  _  £B  P2  +  p(m  +  k)  =  PKP  ^  0 

P  -  \  2  p  -  \  2  2  2 

so  (by  ll,  198]  for  instance),  it  cannot  be  identically  zero  unless/?  =  0  ( i.e. .  q  —  1). 

It  seems  obvious  that  the  arrival  process  and  feedback  process  are  not  independent 
processes.  One  can  show,  using  the  above  arguments: 

COROLLARY  3:*  Either  the  feedback  process  is  not  a  Poisson  process  or  the  arrival  pro¬ 
cess  and  feedback  process  are  not  independent  processes  (or  both)  for  the  M/M/I  queue  with 
instantaneous,  Bernoulli  feedback. 

PROOF:  This  result  follows  immediately  from  Burke's  result  |2]  on  the  distribution  of 
the  interinput  arrivals.  For  if  the  feedback  process  is  both  independent  of  the  arrival  process 
and  is  itself  a  Poisson  process,  the  input  process  is  Poisson.  Thus,  Burke's  result  contradicts 
the  assumption,  n 

3.3  Feedback 

The  feedback  stream  seems  to  be  quite  difficult  to  work  with.  From  the  previous  section 
we  know  that  it  is  either  not  independent  of  the  arrival  stream  or  not  a  Poisson  stream 
Melamed  [9]  has  shown  that  this  feedback  process  is  not  a  Poisson  process.  We  conjecture 
further  that  it  is  not  independent  of  the  arrival  process.  If  so,  then  the  known  superposition 
theorems  cannot  be  used  to  study  feedbacks  in  terms  of  the  arrival,  feedback  and  input 
processes. 

Since  the  feedback  stream  is  the  result  of  applying  a  filter  to  the  Markov-renewal  output 
process,  it  is  itself  Markov-renewal  on  the  state  space  {1,2,  . . .).  Even  in  the  M/M/\  case,  the 
form  of  the  feedback  stream  does  not  appear  to  reduce  to  that  of  any  simpler  process. 

4.  CONCLUSIONS 

There  are  several  conjectures  that  one  can  pose  concerning  networks  based  on  the  results 
of  this  paper.  First  with  respect  to  queue  length,  busy  period,  and  departure  processes,  if  one 
adopts  the  "outsiders"  view  [3]  these  processes  appear  to  be  those  generated  by  an  Ml 071 
queue  without  feedback.  However,  if  one  adopts  the  "insider"  view  the  queue  length  process 
does  not  appear  to  behave  as  seen  by  the  "outsider." 

Flow  processes  in  this  network  cannot  be  explained  by  appeal  to  superposition,  stretching, 
and  thinning  results  for  Poisson  processes.  The  requisite  independence  assumptions  both 
within  and  between  streams  of  events  are  not  satisfied  here.  Thus,  one  cannot  assume  that 
these  queues  which  act  "as  if'  they  were  Ml  Ml  1  queues  to  the  "outsider"  are  Ml  Ml  1  queues 
to  the  "insider."  In  particular,  this  hints  at  the  possibility  that  in  these  networks,  even  as  simple 
as  Jackson  networks,  any  attempt  to  decompose  the  network  into  independent  M/Ml  1  queues 
is  doomed  to  failure.  This  decomposition  must  account  for  the  internal  flows  and  these  not 
only  appear  to  be  non  Poisson,  they  are  nonrenewai  and  arc  dependent  on  each  other 
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In  (9),  it  is  shown  that  in  the  Jackson  structure,  the  flow  along  any  path  that  returns  a 
customer  to  a  node  that  he  has  previously  visited  is  not  only  not  Poisson,  it  is  not  renewal. 
Thus,  if  Jackson  networks  have  loops,  (direct  feedback  as  in  this  paper  being  the  simplest 
example),  they  cannot  be  decomposed  into  sub-networks  of  simple  Ml  Ml  1  queues.  In  particu¬ 
lar,  these  results  imply  that  a  node-by-node  analysis  of  waiting  times  depending  as  they  do  on 
the  "insiders"  view  is  not  valid  if  one  simply  uses  M/ M/\  results  at  each  server.  Takacs  [10] 
studies  the  waiting  time  problems  in  the  system  discussed  in  this  paper.  Disney  [6]  presents 
another  view  of  the  same  problem. 
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AN  INVENTORY  MODEL  WITH  SEARCH 
FOR  BEST  ORDERING  PRICE* 

Kamal  Golabi 
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San  Francisco,  California 

ABSTRACT 

This  paper  presents  a  single-item  inventor)  model  with  deterministic 
demand  where  the  buyer  is  allowed  to  search  tor  the  most  favorable  price  be¬ 
fore  deciding  on  the  order  quantity  In  the  beginning  of  each  period,  a  sequen¬ 
tial  random  sample  can  be  taken  from  a  known  distribution  and  there  is  a  fixed 
cost  per  search.  The  decision  maker  is  faced  with  the  task  of  deciding  when  to 
initiate  and  when  to  slop  the  search  process,  as  well  as  determining  the  optima! 
order  quantity  once  the  search  process  is  terminated  l  he  objective  is  to 
minimize  total  expected  costs  while  satisfying  all  demands  on  time  We 
demonstrate  that  a  set  of  critical  numbers  determine  the  optimal  stopping  and 
ordering  strategies.  We  present  recursive  expressions  yielding  the  critical 
numbers,  as  well  as  the  minimal  expected  cost  from  the  beginning  of  even 
period  to  the  end  of  the  horizon 


I.  INTRODUCTION 

This  research  is  an  attempt  to  marry  some  aspects  of  search  theory  and  optimal  stopping 
with  inventory  theory.  Following  the  pioneering  work  of  Stigler  (111.  [I2|.  searching  for  the 
lowest  price  is  considered  a  basic  feature  of  economic  markets.  By  citing  examples  based  on 
real  data,  Stigler  [111  asserted  that  prices  change  with  varying  frequency  in  all  markets,  and 
unless  a  market  is  completely  centralized,  the  buyer  will  not  know  for  certain  the  prices  that  the 
various  sellers  quote  at  any  given  lime.  This  suggests  that  at  any  time  there  will  he  a  frequency 
distribution  of  the  prices  quoted  by  sellers  If  the  dispersion  of  price  quotations  by  sellers  is 
large  compared  to  the  cost  of  search,  it  will  pay— on  average  — to  obtain  price  quotations  from 
several  sellers  before  taking  an  "action"  The  vast  literature  on  search  theory  (a  survey  of 
which  can  be  found  in  Lippman  and  McCall  [8 1 .  l)e(iroot  [51.  and  Rothschild  1 1 0[ )  is  con¬ 
cerned  with  rules  that  the  searchers  should  follow  when  the  "action"  is  accepting  or  rejecting  a 
price  Once  the  price  has  been  accepted,  the  decision  process  terminates  In  many  dynamic 
models,  the  action  is  more  complicated  In  inventory  models,  lor  example,  the  decision  not 
only  involves  accepting  or  rejecting  an  ordering  price  hut  how  much  to  order,  an  action  which 
will  affect  the  search  and  ordering  policies  in  future  periods  In  this  paper  we  study  such  a 
model  We  seek  the  best  search  and  ordering  policies  lor  a  simple  dvnanuc  inventory  problem 
with  deterministic  demands  where,  in  the  beginning  ol  each  period,  the  purchaser  can  search 
for  the  lowest  price  before  placing  an  order 

*  I  his  rt'MMfth  w.is  jviUmIIv  MJp|*'»ti*«l  h\  the  Vili'-ntl  Vn*n>-  I  .  •unvl.iii.n  thfuigh  (  it.tni  NS|  |  N(i"*4  I  t4‘M  .irul  the 
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Classical  optimal  search  considers  the  following  problem:  A  purchaser  can  take  a  sequen¬ 
tial  random  sample  A'|,AN.  ...  from  a  continuous  distribution  with  a  known  distribution  func¬ 
tion  F.  There  is  a  fixed  cost  s  per  observation.  Suppose  that  if  the  decision  maker  stops  the 

sampling  (search)  process  after  the  values  .V,  =  .v,.  A':  =  v: . V„  =  ,v„  have  been  observed. 

his  cost  is  ,v„  +  sn.  Hence,  the  problem  is  to  find  a  stopping  rule  which  minimizes  /.  ( A\  +  s.\) 
where  ,V  indicates  the  random  number  of  observations  that  are  taken  under  a  specified  slopping 
rule.  It  can  be  shown  that,  whether  sampling  is  with  or  without  recall,  the  optimal  stopping 
rule  is  characterized  by  a  unique  critical  number  v*  (usually  called  the  reservation  price)  so  that 
an  optimal  sampling  rule  is  to  continue  sampling  whenever  an  observed  value  exceeds  v*  and  to 
stop  the  process  as  soon  as  some  observed  v  alue  does  not  exceed  v*  Vat  it  us  v  ersions  of  this 
problem  have  been  studied  by  MacQueen  and  Miller  !1>I.  Herman  and  Sacks  |h|  and  (  how  and 
Robbins  [21.  13]  among  others. 

The  above  search  model  can  be  visualized  as  a  one  period  purchasing  problem  in  which 
one  unit  of  some  commodity  has  to  be  purchased  at  the  beginning  of  the  period  Now  consider 
a  dynamic  multiperiod  version  of  this  problem  where  a  demand  of  one  unit  lias  to  be  satisfied 
in  iw/i  period  and  inventory  holding  cost  is  charged  for  items  held  over  for  use  in  subsequent 
periods.  As  in  the  classical  search  problem,  in  the  beginning  of  each  period  a  sequential  ran¬ 
dom  sample  A’|,.Y:,  ...  can  be  taken  from  a  distribution  with  known  distribution  function  /. 
but  the  decision  process  is  not  terminated  as  soon  as  an  acceptable  value  is  observed  I  be  deci¬ 
sion  maker  is  faced  with  the  task  of  deciding  how  much  to  order  so  as  to  minimize  total 
expected  costs  while  satisfying  all  demands  on  time.  When  the  inventory  level  is  sulVicient  to 
satisfy  the  immediate  demand,  he  has  also  the  burden  of  deciding  whether  to  initiate  search  at 
all  l  lus  multiperiod  rhodel  is  the  subject  of  our  study  in  tin-,  p.ipvi 

In  Section  2.  we  present  the  model.  In  Section  3.  we  give  the  optimal  search  policy  ami 
in  Section  4.  the  optimal  ordering  policy.  We  show  the  intuitive  result  that  an  optimal  strategy 
prescribes  that  search  should  he  initiated  only  when  the  inventory  level  is  zero  I  urthermorv. 
we  show  that  the  reservation  price  property  of  the  classical  search  problem  soil  holds  lh.it  is 
when  the  inventory  level  is  zero  (anil  therefore  search  has  to  be  initialed >  and  n  periods  remain 
to  the  end  of  the  problem,  there  exists  a  reservation  price  <>,  such  that  a  price  should  be 
accepted  if  it  does  not  exceed  o„  and  rejected  otherwise  In  Section  4.  we  show  that  once  a 
price  has  been  accepted,  a  finite  number  of  critical  numbers  specif v  the  optimal  strategy  I  he 
critical  numbers  divide  the  interval  [().<»,,]  into  segments  so  that  the  interval  in  which  the 
accepted  price  falls  determines  the  optimal  order  quantity  We  give  recursive  expressions  which 
yield  n„  as  well  as  the  minimal  expected  cost  for  any  period  to  the  end  of  the  horizon  We  will 
also  obtain  expressions  describing  the  critical  numbers  when  the  holding  cost  function  is  con¬ 
vex 

2.  THE  MODEL 

Consider  a  multi-period  single-item  inventory  model  in  which  a  demand  of  one  unit  has 
to  be  satisfied  in  the  beginning  of  each  period  and  an  inventory  holding  cost  is  charged  In 
each  period,  a  sequential  random  sample  Y|.Y-.  of  ordering  prices  can  he  generated  from  a 
continuous  distribution  with  known  cumulative  distribution  I  unction  /  (  l.  /  <  I .  i  <  >«'.  and  the 
A  s  are  mutually  independent.  I  he  cost  ol  generating  each  random  price  is  s  and  there  is  no 
limn  on  the  number  of  observations  which  can  be  made  in  each  period  After  receiving  a  price, 
the  decision  maker  has  to  decide  whether  to  accept  that  price  oi  generate  anothei  otler  It  he 
accepts  the  offered  price,  he  is  laced  with  the  decision  of  how  much  lo  order  AN  hen  the  inven¬ 
tory  level  is  sufficient  to  satisfy  the  immediate  demand,  he  also  has  to  decide  whethei  to  initiate 
search  at  all  I  he  objective  is  to  minimize  the  total  expected  costs 
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We  assume  that  the  length  N  of  the  planning  horizon  is  finite,  initial  inventory  is  zero, 
backlogging  of  demand  is  not  allowed,  the  cost  of  holding  c  units  for  one  period,  /;(;).  is  non¬ 
decreasing  in  r  and  /;(0)  =  0,  the  purchasing  cost  is  linear  in  the  quantity  ordered,  and  only 
integer  quantities  can  be  ordered.  We  also  assume  prices  that  are  not  accepted  immediately  are 
lost;  in  view  of  our  results,  sampling  with  recall  (of  prices  in  the  same  period)  extends  no  addi¬ 
tional  advantage  over  sampling  without  recall,  and  hence  would  not  affect  the  search  policy. 
Note  that  when  N  =  1,  this  model  reduces  to  the  classical  search  problem. 

Let  n  be  the  number  of  periods  remaining  until  the  end  of  the  horizon,  r  the  inventory  on 
hand  with  n  periods  remaining  and  .v  the  last  price  received.  In  each  period,  our  state  space 
consists  of  numbers  (r)  and  pairs  (r..v)  corresponding  respectively  to  the  state  of  the  system 
before  a  search  is  placed  and  the  state  when  a  search  has  been  placed  and  an  offer  v  has  been 
received.  A  policy  for  period  n  prescribes  a  search  decision  for  stale  ( r ) ,  and  a  reject-aecept 
and  ordering  decision  for  state  (r„v).  We  assume  that  for  each  period  an  optimal  policy  exists. 
Moreover,  we  restrict  our  attention  to  history-independent  policies;  that  is,  once  the  price  ,v  has 
been  rejected,  we  are  in  the  same  position  as  having  not  placed  the  search  at  all  Schematically, 
remembering  that  demand  in  each  period  equals  one,  the  period-state  pairs  correspond  to  each 
other  as  follows: 

l'or  r  ^  l : 


s'  Reject 

(fj  q)  Search  t  ,N  n  ,  Accept  v  and  order  an  amount  a 

3.  OPTIMAL  SEARCH  POLICY 

In  this  section,  we  present  the  optimal  search  policy.  We  show  that  search  should  only  be 
initiated  when  the  inventory  level  is  zero,  and  prove  that  in  each  period  a  single  reservation 
price  determines  the  stopping  rule.  We  also  give  a  recursive  expression  which  describes  the 
sequence  of  reservation  prices. 

To  being,  define 

l„(:..v)  =  the  minimal  (conditional)  expected  cost  during  the  last  n  periods  when  the 
inventory  level  with  n  periods  remaining  is  c  and  the  last  price  offered  is  x 

v,,(r)  =  the  minimal  expected  cost  during  the  Iasi  n  periods  before  ihe  decision  to 

search  for  an  offer  is  made  and  when  the  inventory  level  with  n  periods 
remaining  is  r. 
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un(z)  =  the  minimal  expected  cost  during  the  last  n  periods  after  the  decision  to 
search  for  an  ofTer  in  this  period  has  been  made,  and  when  the  inventory 
level  with  n  periods  remaining  is  r. 

H'„(r)  =  the  minimal  expected  cost  during  the  last  n  periods  after  the  decision  not  to 

search  for  an  offer  in  this  period  has  been  made,  and  when  the  inventory 
level  with  n  periods  remaining  is  z.  z  ^  1 . 

H(z)  -  the  total  holding  cost  of  c  units  to  be  used  in  r  consecutive  periods. 

Hence,  we  will  have  the  following  relationships: 

(1)  v„(j)  =  min[o„(r),  h„ (^ )] 

(2)  fy.fz.x)  =  min(v„(r),  min  (a.v  +  li(:  +  a  -  1)  +  v„. +  a  -  Dll. 

at  II.  2..  .n  -  :\ 

(3)  u„(z)  =  s  +  F.x  l»;(.-..v)l, 

(4)  u„(:)  =  /if:  -  1)  +  v„  -  1).  r  ^  I. 

and 

(5)  H(z)  =  £  li(z  -  i)  =  £  /;</). 

<-i  /-i 

Define 

(6a)  /„[.v,o]  =  ax  +  It  (a  -  I)  +  v„  |(o  -  1) 

and 

(6b)  /„(.v)  =  min  /„  (x.a ). 

ui  11.2 — /i! 

and  let  o„  (a  )  be  the  minimizing  value  of  a  in  (6a).  that  is. 

(6c)  /„(.v)  =  /„|x,o„(x)l. 

The  quantity  /„(.v)  is  the  minimal  expected  cost  attainable  during  the  last  n  periods  when  the 
inventory  level  with  n  periods  remaining  is  zero  and  it  has  been  decided  to  accept  the  last 
price  offered. 

At  this  point  it  is  natural  to  ask  whether  when  n  periods  remain,  there  exists  a  single  criti¬ 
cal  price  a„  which  dictates  the  acceptance  or  rejection  of  a  price  ,v  when  the  inventory  level  is 
zero.  In  other  words,  is  there  an  a„  such  that  it  is  optimal  to  accept  the  price  v  (and  order  a 
positive  amount)  if  x  <  «„  and  to  continue  the  search  if  v  >  That  this  is  indeed  the  case, 
is  verified  in  Theorem  I 

Define 

(7)  1  [v„<0)|. 

and  the  sequence  (.4„)„'-i>  by  the  following  recursion 

(8a)  -4|)  =  0 

and 

.4„F(n„)  =  v  +  f  min  (ox  +  //(a)  +  A„  ,  1  <//■'(. v)  for  n  ^  1. 

1  ...ii’ 


(8b) 
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We  will  show  later  that  a,  exists  and  that  A„  equals  v„(0).  so  that  a„  =  l;;HA„).  These  pro¬ 
perties  are  exploited  to  verify  that  an  optimal  policy  prescribes  that  search  be  initiated,  and  ord¬ 
ers  be  placed,  only  when  the  inventory  level  is  zero.  Furthermore,  we  will  show  that  if  the  set 
of  prices  at  which  it  is  optimal  to  order  one  unit  is  nonempty.  a„  =  A„  —  A„ _ ,  so  that  Equation 
(8b)  can  be  written  as 


(9) 


A„F(A„  —  A„  |)  =  s  +  I  min  lax  +  H(a)  +  A  ]dFL\). 

*/0  ai  11.2.  nl 


enabling  us  to  obtain  the  minimal  expected  cost  from  the  beginning  of  any  period  to  the  end  of 
the  horizon  by  finding  |/4,),-ik  the  unique  set  of  solutions  to  Equation  (9). 


THEOREM  1:  If  the,  inventory  level  with  n  periods  remaining  is  zero,  it  is  optimal  to 
continue  the  search  if  x,  the  last  price  offered,  is  greater  than  «„  and  accept  the  price  if 
.v  <  a„,  where  n  =  1,2 . N. 


PROOF:  Clearly,  / „(x.a )  is  continuous  in  .v  for  each  n  and  a ,  and  therefore,  /„(.v)  is  a 
continuous  function  of  x.  Furthermore,  for  all  positive  numbers  e, 

/„(.  V  +  «)  =  l„[x  +  €.  «„(.V  -+-«)!>  /„  I.Y.6J,,  (.v  +  t )]  >  /„  [.v,«„  (,v )  1  =  /„(.v), 

and  hence  /„(x)  is  strictly  increasing  in  x  let  be  such  that  /„I«„(.r)l  =  »,  i.e.. 

<«„(y)  =  /„  '(>').  Since 

v„(0)  >  v„_ i (0)  ^  min  l /; ( «  -  1)  +  v„  ,(u  -  1))  =  /„(0). 

ill  11.2.  .«! 

it  follows  that  <*„  =  «„|v„(0)]  exists  and,  as  /„< a  )  is  strictly  increasing  in  v.  it  is  unique  (see 
Figure  1).  The  first  inequality  of  the  above  expression  follows  from  the  fact  that  for  the  n  -  1 
period  problem  we  can  always  follow  the  optimal  policy  for  the  n  period  problem,  so  that  at 
each  stage  m.  n  -  1  >  m  >  1.  we  would  adopt  the  action  prescribed  by  the  n  period  optimal 
policy  for  stage  m  +  1.  Thus,  the  expected  cost  for  the  n  -  1  period  problem  under  this  pol¬ 
icy,  w'n.  !  (0) .  would  be  equal  to  the  expected  cost  of  the  first  n  -  1  periods  of  the  n  period  prob¬ 
lem,  and  hence  v/,  |(0)  ^  v„(0).  Since  v„_ !  (0)  <  v,'_ ,  (0) ,  it  follows  that  v„ (0)  is  nondecreas¬ 
ing  in  n. 
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From  (2)  and  (6)  we  have 

F„(0,x)  —  min  (v„(0),  /„(x)]. 

If  x  <  then  /„(x)  <  v„(0)  so  that  L„(0,x)  =  /„(x)  and  search  terminates.  If  x  >  a„, 
then  In(x)  >  v„(0)  so  that  T„(0,x)  -  v„(0),  in  which  case  it  is  optimal  to  continue  the  search. 

Q.E.D 

Thus,  when  the  inventory  level  is  zero,  a  single  critical  number  determines  whether  a 
price  should  be  accepted.  We  are  also  interested  in  finding  the  optimal  strategy  when  the 
inventory  level  is  positive.  It  seems  intuitive  that  if  the  immediate  demand  can  be  satisfied  by 
the  current  inventory,  it  would  be  best  to  postpone  the  search— since  it  is  possible  to  incur  the 
same  amount  of  expected  search  cost  in  a  later  period  while  saving  on  the  holding  cost.  The 
next  result,  the  proof  of  which  is  given  in  the  Appendix,  verifies  this  observation.  In  addition, 
it  shows  that  the  expected  cost  from  any  period  k  in  which  the  inventory  level  is  zero  to  the 
end  of  the  horizon  equals  Ak.  Thus,  the  expected  cost  from  any  period  can  be  determined  by 
computing  the  sequence  {zl„)  from  Equation  (8b). 

THEOREM  2.  Under  the  assumptions  of  the  model,  for  all  k ,  A:  =  1,2 . N, 

(a)  vA  (0)  =  Ak 

(b)  vA.(z)  =  H(z)  +  vA_.(0)  for  1  <  z  <  k. 

Theorem  2  verifies  that  search  should  be  initiated  only  when  the  inventory  level  is  zero, 
and  Theorem  1  gives  a  rule  for  accepting  or  rejecting  an  offered  price  once  search  is  initiated. 
These  two  results  however,  do  not  completely  specify  the  optimal  strategy.  Given  that  an 
acceptable  price  is  received,  we  would  like  to  know  how  much  should  be  purchased  at  that 
price.  This  question  is  investigated  in  the  next  section. 

4.  OPTIMAL  ORDERING  POLICY 

In  this  section  we  present  the  optimal  ordering  policy  once  an  acceptable  price  has  been 
received.  In  Corollary  3  we  show  that  a  nonincreasing  sequence  of  critical  numbers  characterize 
the  optimal  order  quantity.  In  other  words,  once  a  price  is  received  that  is  less  than  the  reser¬ 
vation  price  for  that  period,  the  interval  in  which  the  offered  price  fails  determines  the  quantity 
that  should  be  ordered  at  that  price.  In  Theorem  5  we  obtain  expressions  which  describe  these 
critical  numbers  when  the  holding  cost  function  is  convex. 

Before  presenting  the  next  result  we  note  that  when  n  periods  remain,  the  inventory  level 
is  zero,  and  an  acceptable  price  has  been  received,  the  optimal  order  quantity  is  equal  to  a„(x). 
To  see  this,  note  that 

I'„(0.x)  =  min[(v„(0),  /„ (x)| 

by  (2)  and  (6).  This  fact  coupled  with  Theorem  l  yields  Ln(0,x)  -  /„(x)  whenever  x  < 
Finally,  since 

(10)  /„<x>  *  /„ |x,a„ (x )  1  *  min  [ax  +  //(a  —  1)  +  v„  tla  ~  1)J, 

1  ^  a  ^  n 

it  follows  that  ordering  a„(x)  minimizes  the  expected  cost  attainable  during  the  last  n  periods 
when  the  inventory  level  is  zero  and  x  ^  a„.  Note  also  that  by  Theorem  2(b),  Equation  (10) 
can  be  written  as 
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(11) 


/„(.\)  =  min  [a.v  4-  H(a)  4  A„-a  1. 

1  <  a  ^  n 


COROLLARY  3:  If  n  periods  remain,  the  inventory  level  is  zero,  and  an  acceptable  price 
has  been  received,  then  the  optimal  order  quantity  is  nonincreasing  in  the  price  offered,  i.e.. 

a„(x')  <  a„(x)  whenever  x'  >  x,  n  =  1,2 . N.  Consequently,  a  nonincreasing  sequence 

of  critical  numbers  {#,(«)), 'L|  characterize  the  optimal  order  quantity.  Specifically,  it  is  optimal 
to  order  k  units  whenever  Bk  (// )  <  ,v  <  Bk  _ ,  in ). 

PROOF:  From  (6c)  and  (11),  we  have 

l„(x)  =  /„[.\.«„(.v)l  =  a„  (,v )  v  4  //|u„(.v)l  4  A„  „  (>l 
<  /„l.v,o„(.\')l  =  a„ix‘)  v  4  tl[an(x')]  4  A„.  i(_ 

giving 

(12)  .vla„(.v')  -  u„(.v)]  ^  An.u  ,,,  -  4  //|a„(.v))  -  //|u„(.v')). 

If  a„(x')  >  a„(x).  then  (12)  implies 

.v'(fl„(.v')  -  a„ (x ) ]  >  -  A„  „:(x.|  4  //[a„(.v)|  -  //[a„(.v')l. 

which  yields 

/„<-v  )  =  a„(.v')  ■  ,v'  4  A„  <7  , v-,  4  //Ia„(.v')l  >  a„(x)  ■  x'  4 
4  //(a„(.v))  =  /„ l.v'.a,, (.v )), 

contradicting  the  fact  that  a„(.v')  is  optimal  when  .v'  is  offered.  Q.K.l). 

Intuitively,  we  would  expect  that  when  an  offered  price  equals  the  critical  number  <»„.  we 
would  be  indifferent  between  ordering  one  unit  and  not  ordering  at  all.  If  this  were  indeed  the 
case,  the  expected  cost  when  the  price  is  rejected,  v„(0).  would  be  equal  to  «„  4  v„  , (0)  yield¬ 
ing  «„  =  A„  -  A„- 1.  This  result  could  then  be  used  to  obtain  a  simple  expression  for  the 
Bk(n)’s  when  /;(•)  is  convex.  As  we  will  show  in  Lemma  4,  the  above  result  holds  if  the  set  of 
prices  at  which  it  is  optimal  to  order  one  unit  is  nonempty.  Unfortunately,  as  seen  from  the 
following  example,  this  is  not  always  the  case. 

EXAMPLE  I.  Let  n  =  5,  s  =  5.  hi:)  =  0  for  all  r  and  the  price  distribution  be  such  that 

PiX  =  2)  =  1  —  e ,  and  Pia  ^  X  ^  h)  =  ~  ib  -  a)  for  0  ^  a  <  b  ^  4.  where  2  is 

4 

excluded  from  all  intervals  and  «  is  an  arbitrary  small  number.  Suppose  the  offered  price  in  the 
beginning  of  the  fifth  period  is  3. 

The  expected  cost  of  rejecting  the  offered  price  is  (approximately) 

5  4  2  x  5  =15, 

as  one  would  pay  the  search  cost  of  5  and  almost  definitely  receive  the  price  of  2.  at  which  one 
would  order  5  units  However,  the  expected  cost  of  ordering  /  units.  /  ^  4.  is  (approximately) 

3/  4  5  4  2(5  -  /)  =  15  4  /. 


while  the  cost  of  ordering  5  units  is  15  Hence,  we  would  be  indifferent  between  not  ordering 
and  ordering  at  v  =  3.  which  implies  that  m  =  3. 
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Since  at  x  =  3  we  order  5  units,  any  price  above  3  is  rejected,  and  the  optimal  order  quan¬ 
tity  a„(x)  is  nonincreasing  in  x,  it  follows  that  lx  \an(x)  =  1)  is  empty. 

LEMMA  4:  If  (x  .a„(x)  =  1]  is  nonempty,  thena„  =  A„  -  A„-\. 

PROOF:  Let  x  be  the  largest  x  such  that  a„(x)  =  1.  By  Theorem  1,  «„  is  the  highest 
price  at  which  it  is  optimal  to  order  a  positive  quantity.  Therefore,  x  <  a„.  Consequently,  we 
can  conclude  from  Corollary  3  that  a„(a„)  <  1,  but  a„(a„)  is  positive  so  that  an(an)  =  1. 
From  Theorem  2  and  Equations  (7)  and  (11),  we  have 

A„  =  v„(0)  =  /„(<*„ )  =  min  1  aa„  +  H(a)  +  A„~a  1 

n 

=  iani,an)  ■  a„  +  //la„(«„))  +  *a„  +  H(l)  +  A„- 

which  yields  a„  —  A„  ~  Q. E  D. 


Whereas  we  cannot  determine  in  advance  the  conditions  under  which  Lemma  4  would 
hold,  we  can  proceed  by  assuming  that  the  lemma  holds,  and  deterrnine  the  sequence 
that  satisfies  Equation  (9).  We  then  can  obtain  |a„}  from  an  =  1^X(A„).  If  |a„)  and  [A„]  also 
satisfy  Equation  (8b),  by  uniqueness  of  the  solution,  a„  is  indeed  equal  to  A„  -  A„-,. 

It  is  interesting  to  note  that  contrary  to  what  one  might  expect,  a„  is  not  monotone  in  n. 
Before  Theorem  5,  we  give  examples  where  «„  is  not  monotone  irrespective  of  whether 
lx  :  an  (x )  -  1 1  is  empty  or  not. 

EXAMPLE  2:  (a)  Consider  again  Example  1.  Since  we  would  almost  definitely  receive 
the  price  of  2  after  the  first  search,  we  have 

v„ (0)  =  s  +  min  lux  +  H(a)  +  v,,„u(0)]. 

1  ^  a  ^  n 

Thus. 

v,(0)  =5  +  2=7 

v2(0)  =  5  +  min  (2  +  7,4)  =  9. 

From  v„(0)  =  /„(«„),  we  have  « t  =  vt (0)  =  7  and 
9  =  min  («2  +  7,  2  a2) 

yielding  «2  =  4.5.  As  shown  earlier,  <»n  =  3.  Therefore,  a„  is  not  monotone  in  n. 

(b)  We  note  that  «„  is  not  necessarily  monotone  even  if  lx:«„(x)  —  1]  is  nonempty.  Con¬ 
sider  the  case  where  the  price  distribution  is  the  same  as  Example  1  However,  there  is  a  hold¬ 
ing  cost  of  1  per  unit  per  period  and  s  =  2  Then.  //( 11  =  0,  //(2)  =  1.  // (3)  =  3  and 
7/(4)  =  6  and 

v,(0>  =2  +  2=4 

v2(0)  =  2  +  min  (4  +  1,2  +  4]=  7 

v,(0)  =  2  +  min  (6  +  3, 4  +  1  +  4,  2  +  7]  =  1 1 

v4(0)  =  2  +  min  (8  +  6,  6  +  3  +  4,  4  +  1  +  7,  2  +  1 1 )  =  14. 
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From  v„(0)  =  /„(<*„),  we  have 
4  =  a  | 

7  =  min  [2 aj  +  l,aj  +  4] 

1 1  =  min  [3a3  +  3, 2 aj  +  1  +  4, 0:3  +  7] 

14  =  min  [4a4  +  6,  3a4  +3  +  4,  2a4  +1  +  7, 04 +11] 

yielding 

a i  =  4,  a;  =  3,  <23  =  4,  a 4  =  3. 

Note  that  in  this  example,  a„(a„)  =  1  for  1  <  n  <  4  and  the  condition  for  Lemma  4  holds.  It 
can  be  easily  verified  that  an  =  v„(0)  -  v„_t(0)  for  all  1  <  n  <  4. 

THEOREM  5:  If  the  condition  for  Lemma  4  holds  and  if  the  holding  cost  function  h(  ) 
is  convex,  then 

(13)  Bk(n)  =  a„_A  —  h(k),  where  1  <  k  $  n. 

PROOF:  We  have  to  show  that 

(a)  The  RHS  of  (13)  is  nonincreasing  in  k. 

(b)  It  is  optimal  to  order  k  units  if  x,  the  price  offered,  satisfies 

(14)  an-k  -  h(k)  <  x  <  a,.(l_n  -  h(k  -  1). 

To  show  (a),  we  note  that 

A„-k  +  \  =  =  4,-*  +  ]  («„_*  +  ! )  =  ,  +  l 

\^a^n-k  + 1 

+  H(a)  +  A„-k  +  i_a]  <  2an-k+\  +  H( 2)  +  A„-k- 1 
“  2(/f„_*+i  -  <4„-*)  +  * ( 1 )  +  An_k~  1 , 

where  the  first  equality  follows  from  Theorem  2,  the  second  from  (7),  the  third  from  (11)  and 
the  last  from  Lemma  4.  Thus,  by  convexity  of  /;(•), 

h(k)  -  h(k  -  1)  >  Ml)  -  A„.k.x  -  An.k  +  l  + 

=  an-k  —  an-k  + 1 

and,  therefore,  (a)  is  true. 

To  show  (b),  suppose  x  is  such  that  (14)  holds.  We  show  that  ln(x,k  —  j)  < 
l„(x,k  -  j  -  1)  for  each  j  >  0,  and  therefore  ordering  k  units  is  at  least  as  good  as  ordering 
any  amount  less  than  k.  Suppose  ln(x,k  -  j)  >  I„(x,k  -  j  -  1).  Then 

(k  -  j  -  1  )x  +  +  H(k  -  j  -  1)  <  (k  -  j)x  +  An-(k-j)  +  H (k  -  j) 

which  yields 

x  >  A„^(k_j)  /t()c  y  1) 

=  a„-(k-j-\)  -  h(k  -  j  -  1)  >  a„-(*-n  -  h(k  -  \), 

where  the  last  inequality  follows  from  (a).  This  contradicts  the  right  inequality  of  (14).  There¬ 
fore,  ln(x,k  —  y)  <  /„(x,£  -  j  -  1). 
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I.A.x.k  +  / )  $  /,,  (  a. A  +  /  4  1)  lor  each  j  >  0  hy  a  similar  proof, 
lienee,  it  is  optimal  to  order  A  units  whenever  (14)  holds.  Q  I  I) 

5.  REMARKS 

The  purpose  of  this  study  has  been  to  investigate  optimal  search  policies  in  the  context  of 
a  sequential  model.  The  underlying  inventory  model  has  been  chosen  as  a  rather  simple  one. 
There  are  no  setup  costs  involved  and  the  demand  equals  one  unit  in  each  period.  It  would  be 
interesting  to  investigate  more  general  problems  We  suspect  that  both  the  reservation  price 
property  of  Theorem  1  and  the  Wagner-Whitin  113)  type  result  of  Theorem  2  (order  only  when 
current  inventory  level  is  zero)  would  still  hold  for  models  with  setup  costs  and  arbitrary  deter¬ 
ministic  demands.  The  optimal  policy  would  be  a  function  of  setup  costs  as  well  as  the  holding 
cost  and  price  distribution,  fhe  results  should  also  hold  when  the  price  distributions  are  non- 
staiionarv  Given  that  the  initial  inventory  is  /ero,  the  ordering  policy  will  be  such  that  there  is 
no  inventory  in  the  beginning  of  periods  with  favorable  price  distributions 

Another  interesting  extension  is  the  case  wherein  the  search  process  is  adaptive  I  he 
searcher  does  not  know  the  exact  distribution  of  price:  the  price  oiler  is  used  not  only  as  an 
opportunity  to  order  at  that  price  but  also  as  a  piece  of  information  to  update  the  prior  distribu¬ 
tion.  When  the  distribution  of  prices  is  not  known  exactly,  the  form  of  the  optimal  policy  is 
not  obvious.  As  Rothschild  [IOl  points  out.  the  reservation  price  property  of  Theorem  1  would 
not  necessarily  hold  even  for  a  one  period  problem.  Rothschild  presents  the  following  example 
Suppose  there  are  three  prices.  SI.  S2.  and  S3,  and  that  the  cost  of  search  is  SO.OI.  Prior 
beliefs  admit  the  possibility  of  only  two  distributions  of  prices.  Either  all  prices  are  S3  or  thev 
are  distributed  between  SI  and  S2  in  the  proportions  99  to  I  A  man  with  these  beliefs  should 
accept  a  price  of  S3  (as  this  is  a  signal  that  no  lower  prices  are  to  be  had)  and  reject  a  quote  of 
S2  (which  indicates  that  the  likelihood  that  a  much  better  price  will  be  observed  on  another 
draw  is  high). 

However,  when  the  distribution  is  a  member  of  certain  families  of  distributions  hut  has 
one  or  more  unknown  parameters,  Rothschild  [IOl,  DeCiroot  151  and  Albright  111  have  shown 
that  the  reservation  price  property  holds  for  the  one-period  problem.  We  conjecture  that  when 
the  distribution  of  price  is  stationary  but  is  not  known  exactly,  search  should  be  initiated  only 
when  the  inventory  level  is  zero.  If  this  is  the  case  and  the  distribution  belongs  to  one  of  the 
families  of  distributions  studied  hy  Rothschild  l!()|  and  Albright  fli.  then  the  reservation  price 
property  as  well  as  the  ordering  policy  presented  in  Section  4  should  still  hold. 
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APPENDIX 

THEOREM  2:  Linder  the  assumptions  of  the  model,  for  all  k,  k  —  1.2,  ....  /V. 

(a)  V;  (0)  -  Ak 

(b)  v* (r )  =  H(:)  +  v*  ..(0)  for  I  <  r  <  k. 

Consequently,  the  search  process  is  initiated  only  when  the  inventory  level  is  zero. 

Before  proving  Theorem  2,  we  establish  two  elementary  facts. 

LEMMA  A:  For  any  two  positive  integers  land  j,  H(i  +  /)  >  H(i)  +  H(j). 

PROOF: 

i * /  i  i  i  i*i  i  <  •  i  ,i 

//(/+./)=  £  /»(*)-£  M*)  +  £  //(A)  >  £  hlk)  +  £  h(k) 

/.-l  *-l  k-i  n-i  *-i 

=  HU)  !  H(j).  Q  E  D 


~  r»  /f  t  t  ) 

LEMMA  B:  The  integral  |  1  v  —  /„(x)]  dP(x)  =  G'„(  v)  is  strictly  increasing  in  v. 

•/<) 

continuous,  and  unbounded  above. 

PROOF:  Since  /„[a„(y)]  -  y  and  /„(x)  is  strictly  increasing  in  ,v,  it  follows  that  <*„(>•)  is 
strictly  increasing  in  y.  Hence,  G„(.v)  is  strictly  increasing,  continuous  (as  Fis  continuous)  and 
unbounded  above.  c  i-> 
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PROOF  OF  THEOREM  2:  The  proof  is  by  induction  on  k.  From  Equations  (1),  (3),  (2) 
and  (6),  we  have 


(A-l ) 


v*(0)  -  uA(0)  -  s  +  £,(FA(0,x)) 

=  s  +  £  min  i Vx  (0) ,  min  lax  +  h(a  -  1)  +  v.  ,  ( a  —  0] 

1  I  ^  a  <  * 


=  s  4-  E  min  (vA(0),  /A(x)l. 

For  k  =  1,  (b)  is  obvious.  To  show  (a),  note  that  by  (6),  / |(x)  —  x.  Next,  from  (A-l) 
we  have 

J.V.(O)  —  oo 

xdF(x)  +  J  V|(0  )dF(x), 

from  which  we  obtain 

( A-2)  v,(0)  £[v,(0)J  =  s  +  f  xdF(x). 

(Note  the  close  connection  between  V|(0)  and  the  maximizing  price  in  the  house  selling  prob¬ 
lem.)  In  order  to  determine  whether  V|(0)  is  the  unique  solution  to  (A-2),  note  that  it  is 
equivalent  to  verify  that  s  =  j  ( v  —  x)  dF(x)  —  C >  (j^ )  has  a  unique  solution.  The  latter 

«r(l 

result  follows  from  Lemma  B. 

From  (7)  we  have  /|(a,)  =  v , (0)  and  therefore  a,  =  v , (0) .  Thus,  (A-2)  becomes 
<»i£(a,)  =  s  +  f  xdF(x), 

•J  0 

which  coupled  with  (8b)  for  n  -  I,  gives  A\  -  at|  —  v, (0)  so  that  (a)  holds  for  k  -  1. 


Assume  (a)  and  (b)  hold  for  k  =  1,2 . n  —  1.  We  show  that  the  theorem  holds  for 


A  =  n. 


From  (A-l),  we  have 
v„(0)  =  .v  +  E  min  (v„(0),  /„(*)) 

-  s  +  f  l„(x)  dFix)  +  v„(0 )dF(x) 

•'(I  */«i 


=  (£(«„))  1  s  +  f  I  min  (ax  +  h(a  -  I)  +  v„_,(a  -  l)]|</F(x)l 

)  | 

=  !/■  («, ,)(  'Is  +  f  f  min  [ax  +  h(a  -  I)  +  H(a  -  1)  +  v„  ,(0)]|  dF(x) 

( I  <,/<«  J 


=  (£(«„)] 


4-  ^  |  min  (ax  +  //(a)  +  A„  „]Jc/£(x) 


=  |f(tt„)l  1  A„  F(tt  „ ) 

=  A„. 

where  the  second  equality  follows  from  Theorem  1,  the  third  from  a  simple  rearrangement  of 
terms,  the  fourth  and  fifth  equalities  from  the  induction  hypothesis  and  the  sixth  equality  from 
(8b)  Therefore  (a)  is  true  for  A  -  n 
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Since  we  are  assuming  that  (b)  holds  for  k  —  n  -  1,  it  follows  that 
1 A  -  3 )  /„<v)  «=  min  [a.v  +  H(a)  +  A„  J 

I  \  JVM 

and 

.fi  ln.,1  -  i  +  f  /„(.v>  t/Flx). 
which  gives 

(A-4)  f  M„  -  /„(.v)l  (/Fix)  -  s. 

Jo 


We  note  i hat  by  (4)  and  (he  induction  hypoihesis,  for  1  ^  r  ^ 

»„(;)  =  /)(.--  1)  +  v„  -  1)  =  hi;  -  1)  +  Hlz  -  1)  +  v„  (0) 

=  H(z)  +  v„  .(0), 

and  therefore  to  prove  (b)  for  k  =  n.  it  suffices  to  show  v„(r)  =  H„(r)  whenever  r  ^  1.  That 
is.  we  need  lo  show  //„ ( r )  >  HI:)  +  v „  . ( 0 )  whenever  :  >  1. 


We  can  write 


u  (: )  =  v  +  min|v„(r ).  min  la.v  +  h  (:  +  a  -  11  +  v„  .  (r  +  a  -  I  )1 

j  '  K  ..  S  n  ■: 

=  s  +  /:'  min|//„(z).  M„(r).  ^  min  lav  +  III:  +  a  -  1)  +  v„  +  a  -  1)| 


=  s  +  /:'  min 


=  s  +  E  mm 


^  s  +  I.  min 


=  •>  +  /-.  min 


=  s  +  l.  min 


t/„( r).  min  la.v  4-  hi:  +  a  —  I)  +  v„  ,(;  +  a  -  1)1 

os  „  ■  ..  . 


a.,(r).  min  [a.v  +  HI:  +  a)  +  A„ 

II*.  ,/  <  .i  ; 


ii, ,):) .//):)  +  min  la.v  +  //(a)  +  A„  J 

a*  .,'  n  ■ 


u„l:).Hl:)  +  min  1.4, ,  ,  min  (a.v  +  //(a)  +  A„  ,)]( 


«„(r).//(r)  +  min  |v„  (())./,,  ( a  ) ) 


where  the  first  equality  follows  from  (3)  and  (2).  the  second  from  (1).  the  fourth  from  induc¬ 
tion  hypoihesis  and  the  Iasi  equality  from  the  induction  hypothesis  and  (A-3).  The  inequality 
follows  from  Lemma  A  Hence, 


I  A  -  5 ) 


y 


ii  <:)  -  Hi:)  Ss 


s  +  F  mm 


u, .  ( r  1 


//(:).  mm  |v„  (0) ,/,  ( v  I) 


If  y  were  less  than  s.  (01.  then  from  ( A  -  5 )  we  would  have 
y  >  s  +  /-.  mm  (y  ,/„  (v)| 
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giving 

J"fl  [y  —  l„-.(x)\dF(x)  ^  s  -  "  \A„:  -  l„.:{x)]dF(x) , 

where  the  equality  follows  from  (A-4).  Hence, 

G„-:(y)  >  G„..(A,-:)  -  C„_,lv„_;(0)]. 

contradicting  Lemma  B.  Therefore,  y  >  v„_.(0),  which  completes  the  induction  argument. 

Q.E.D. 
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ABSTRACT 

Tins  pa  pc  i  provides  an  overview  ol  ihe  Computet-  Assisted  Seareli  Planning 
K  XSI’l  s\siem  developed  tor  i he  United  States  Coast  (iuard  The  (  \SP  m 
lot mation  ptoeessmj:  methodologv  is  based  upon  Monte  Carlo  simulation  io 
obtain  an  initial  prohabiluv  distribution  lor  target  location  and  to  update  ibis 
distribution  to  account  lor  drill  due  to  currents  and  winds  \  multiple  Meii.tiio 
approach  is  emploved  io  generate  the  initial  prohabiluv  distribution  Havesian 
updating  is  used  to  lellect  negative  mtoi illation  obtained  liotit  utisiuvesslul 
search  Ihe  principal  output  ol  ihe  (  \SP  svsiem  is  a  sequence  o|  piohahitiu 
"maps"  which  displav  the  currem  ureei  location  prohabiluv  disMihmions 
ihniuuhoul  the  time  period  ol  interest  (  \SP  also  ptovules  emdaiuc  lot  alio 
eating  seat cb  elloit  based  upon  optimal  search  ilieorv 


1.  INTRODUCTION 

This  paper  provides  an  overview  of  the  computer-assisted  search  planning  (C'ASP)  system 
developed  for  the  United  States  Coast  Guard  to  assist  its  search  and  rescue  (SAR)  operations 
The  system  resides  on  a  CDC  3300  located  in  Washington,  D  C.,  and  can  he  used  by  all  USCG 

Rescue  Coordination  Centers  (RCCs)  in  the  continental  United  States  and  Hawaii  via  remote 

access  terminals. 

The  Coast  Guard  is  engaged  daily  in  search  and  rescue  missions  which  range  from  simple 
to  complex.  The  amount  of  information  available  to  predict  the  position  of  the  search  target 
ranges  from  extremely  good  to  almost  no  information  at  all  I  he  process  of  planning,  com¬ 
manding,  and  evaluating  these  searches  takes  place  in  Rescue  Coordination  ('enters  (R(  (s* 
located  throughout  the  United  States  in  major  coastal  cities 

Ihe  entire  planning  process  begins  with  the  awareness  that  a  distress  on  the  water  m.o 
exist  I  his  awareness  usually  results  from  a  telephone  call  Irom  a  Iriciul  01  rol.it i \ c  m  (torn  .1 
radio  communication  from  the  boat  or  vessel  itself 

*  I  ‘'i-s  w < >i  k  w .is  n  jppi *r i ci I  in  p.tr  i  h\  \  S(  ( i  (  I  'm.u  l )•  » I  (  (  i  » ?4V»  \  m  I  i  i\K  <  •  ■  i  V  \i ion 1  j  uo  <  o  1  ;  - 

’  *  I  ”  .•  npiM  i>  "In  i  *1  insOI  'H  »Ds  *  mi:  ,|ilU'i|  hciuut  .It  f  !  pr  •  v  ,i  ■  i  ■ » -  *  v  i  I  '  •'  i  •  !  i '  *'  1  .  t  i  Ol.  « 

•'  mow  t»l  t‘v  (  nmm.iml.in:  m  (mud  ,u 
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Next  all  available  information  has  to  be  evaluated  to  decide  whether  or  not  to  begin  a 
search,  and  what  level  of  etTori  is  required  given  the  search  begins.  At  this  point  a  great  deal  of 
effort  goes  into  deciding  where  the  distress  incident  occurred.  This  might  be  considered  the 
first  phase  of  planning. 

The  next  phase  involves  computing  where  the  search  target  will  be  when  the  first  search 
units  arrive  on  scene.  Among  other  things,  this  requires  the  prediction  of  ocean  drift  and  wind 
velocity  and  the  estimation  of  uncertainties  in  these  predictions. 

The  next  questions  pertain  to  the  effort  allocation  process— how  much  effort  must  be 
expended  and  in  what  areas?  Prior  to  the  advent  of  computer  search  programs,  SAR  planners 
relied  upon  various  rules  of  thumb  as  presented  in  the  National  Search  and  Rescue  Manual 
III]  Simplicity  was  necessary  to  facilitate  hand  computation,  but  at  the  same  lime  prevented 
adequate  treatment  of  the  many  sources  of  uncertainty  which  characterize  a  SAR  incident. 

The  search  phase  is  the  actual  deployment  of  aircraft  and  vessels,  the  conduct  of  preset 
search  patterns,  and  the  report  of  results  back  to  the  RCC. 

If  the  search  is  unsuccessful  for  that  day,  then  the  results  must  be  reevaluated  and  a  new 
search  planned  for  the  following  day. 

This  process  continues  until  the  target  is  found  or  until  the  search  is  terminated.  In  brief 
(and  in  slightly  more  technical  terms),  the  planning  phases  are  as  follows: 

( I )  Determine  the  target  location  probability  distribution  at  the  time  of  the  distress 
incident 

<  2  *  1'pdaie  the  target  location  probability  distribution  to  account  for  target  motion  prior  to 
the  earliest  possible  arrival  of  a  search  unit  on-scene. 

(?)  Determine  the  optimal  allocation  of  search  effort,  and  estimate  the  expected  amount 
of  search  effort  required  to  find  the  target. 

< 4 )  Execute  (he  search 

(5)  If  the  search  is  unsuccessful,  evaluate  the  results  and  update  the  target  location  proba¬ 
bility  distribution  to  account  for  this  negative  information. 

(6)  Repeal  (he  planning  procedures  in  Steps  (2)  through  (5)  until  the  target  is  found  or 
the  search  is  terminated 

These  planning  phases  are  illustrated  in  the  CASP  case  example  given  in  Section  3. 

The  first  efforts  ai  computerization  concentrated  on  the  target  location  prediction  process 
( heanographic  models  were  used  to  compute  drift  and  to  estimate  target  position  The  Mon- 
terey  Search  Planning  Program  and  the  Coast  Guard's  own  Search  and  Rescue  Planning  Sys¬ 
tem  V\KP.  represented  early  computer  assisted  search  efforts,  liven  today,  in  cases  where  the 
information  available  makes  the  planning  straightforward,  the  SARP  program  does  nicely 


In  |97i),  the  Office  of  Research  and  Development  in  Washington  funded  development  of 
a  more  comprehensive  approach  to  search  planning  based  in  part  on  lessons  learned  in  the 
Mediterranean  H-bomb  search  in  1966  (Richardson  15))  and  in  the  Scorpion  search  in  1968 
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(Richardson  and  Slone  [6]).  In  1972.  the  CASP  system  was  delivered  to  the  Operation-, 
Analysis  Branch  of  Commander  Atlantic  Area  in  New  York  for  evaluation,  implementation, 
and  training.  The  system  was  made  operational  early  in  1974. 

CASP  is  now  in  use  in  11  Coast  Guard  rescue  centers.  In  addition.  CASP  has  been  used 
at  the  Air  Force  Central  Rescue  Headquarters  at  Scott  AFB,  Illinois,  to  help  plan  and  coordi¬ 
nate  search  missions  for  lost  airplanes  within  the  continental  United  States.  A  modification  of 
the  CASP  system  has  also  been  provided  to  the  Canadians  for  inland  SAR  planning. 

At  the  present  time,  the  use  of  CASP  is  limited  to  open  ocean  searches.  Even  though 
these  searches  represent  but  a  small  percentage  of  the  total  U  S.  Coast  Guard  search  operations. 
CASP  has  been  credited  with  saving  over  a  dozen  lives. 

Section  2  provides  a  description  of  the  CASP  methodology.  Section  3  illustrates  the  use 
of  CASP  in  an  actual  SAR  incident  involving  the  1976  sinking  of  the  sailing  vessel  S/V  Spirit  in 
the  Pacific,  and  Section  4  describes  CASP  training 

2.  CASP  METHODOLOGY 

The  CASP  information  processing  methodology  is  based  upon  Monte  Carlo  simulation  to 
obtain  an  initial  probability  distribution  for  target  location  and  to  update  this  distribution  to 
account  for  drift  due  to  currents  and  winds.  A  multiple  scenario  approach  is  employed  to  gen¬ 
erate  the  initial  probability  distribution  In  the  sense  used  here,  a  scenario  is  a  hypothetical 
description  of  the  distress  incident  which  provides  quantitative  inputs  for  the  CASP  programs 
Bayesian  updating  is  used  to  reflect  negative  information  obtained  from  unsuccessful  search 

The  principal  output  of  the  CASP  system  is  a  sequence  of  probability  "maps"  which 
display  the  current  target  location  probability  distributions  throughout  the  time  period  of 
interest  CASP  also  provides  guidance  for  allocating  search  effort  based  upon  optimal  search 
theory. 

The  CASP  system  is  composed  of  a  number  of  different  programs,  each  designed  for  a 
different  information  processing  function.  The  program  components  are  MAP.  POSITION. 
AREA,  TRACKLINE,  COMBINATION.  DRIFT.  RECTANGLE.  PATH,  and  MULTI;  the 
functions  are  as  follows: 

(1)  display  the  probability  maps  (MAP). 

(2)  generate  an  initial  distribution  of  target  location  at  the  time  of  distress  (POSITION. 
AREA.  TRACKLINE,  and  COMBINATION). 

(3)  update  the  target  location  probability  distributions  for  motion  subsequent  to  the  time 
of  distress  (DRIFT), 

(4)  update  the  target  location  probability  distributions  for  negative  search  results  and  com¬ 
pute  the  cumulative  detection  probability  (RECTANGLE  and  PATH),  and 

(5)  calculate  optimal  allocations  of  search  effort  (MAP  and  MULTI). 


These  programs  will  be  described  below  following  presentation  of  an  overview  of  the  general 
system  design 
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CASP  System  Design 

The  CASP  system  design  was  molivaied  by  a  desire  lo  provide  a  highly  realistic  probabilis¬ 
tic  description  for  the  target's  location  at  the  time  of  the  distress  incident  and  for  the  target's 
substantial  motion.  In  view  of  the  success  achieved  in  the  Mediterranean  H-bomb  search  [12] 
in  1966,  and  in  the  Scorpion  search  [5]  in  1968.  it  seemed  evident  that  a  Bayesian  approach 
would  provide  a  practical  method  for  incorporating  information  gained  from  unsuccessful 
search. 

Target  motion  modeling  posed  a  more  difficult  problem.  Models  which  were  amenable  to 
an  "analytic"  approach  were  not  flexible  enough  to  give  a  good  representation  of  the  search 
facts  For  example,  Gaussian  motion  processes  (or  mixtures  of  Gaussian  processes)  were  unsa¬ 
tisfactory  in  cases  where  the  search  facts  required  a  uniform  or  annular  shaped  target  location 
probability  density.  Markov  chains  based  on  transitions  among  search  grid  cells  were  unsatis¬ 
factory  in  cases  where  one  desired  to  change  the  grid  in  the  course  of  an  operation.  In  general, 
these  models  tended  to  force  the  facts  to  lit  the  mathematics  to  an  undesirable  extent. 

It  was  also  desired  to  develop  a  modular  system  so  that  additional  features  and  improve¬ 
ments  could  be  made  as  time  went  on.  In  order  to  gain  the  confidence  of  the  users,  the  system 
had  to  be  simple  to  understand  and  require  a  minimum  of  unfamiliar  inputs.  The  design  which 
seemed  best  suited  in  view  of  the  above  considerations  is  a  hybrid  approach  which  uses  Monte 
Carlo  to  simulate  target  motion  and  analytic  methods  to  compute  detection  probabilities. 

A  motivation  for  use  of  Monte  Carlo  was  the  recognition  that  compulation  of  the  poste¬ 
rior  target  location  probability  distribution  can  be  viewed  as  the  numerical  evaluation  of  a  mul¬ 
tivariate  integral  of  high  dimensionality  In  such  cases  li  e.,  high  dimensionality),  classical 
numerical  integration  techniques  perform  poorly  (see.  for  example,  Shreider  [7D  especially 
when  the  integrands  can  have  jump  discontinuities  and  are  not  of  a  simple  analytic  form 
These  problems  are  typical  of  CASP  applications.  Discontinuities  occur  when  the  "target" 
moves  into  a  region  where  search  effort  is  concentrated,  and  the  joint  probability  density  for 
target  position  at  several  specified  times  during  the  search  is  a  very  complicated  function. 

The  underlying  structure  of  CASP  is  a  Markov  process,  with  a  three-dimensional  state 
space  consisting  of  points  ( X .  Y.  <t>).  The  variables  .V  and  Y  denote  latitude  and  longitude  and 
<t>  denotes  search  failure  probability.  For  /=  I.  ...  J,  the  ,/th  Monte  Carlo  replication 
(  V  .  )  ,  <t>  )  represents  the  target's  current  position  (time  is  implicit)  together  with  the  cumula¬ 
tive  probability  of  search  failure  for  that  particular  target  replication  computed  for  its  entire  his¬ 
tory  Target  motion  is  assumed  to  be  Markovian  and  successive  increments  of  search  are 
assumed  to  be  statistically  independent.  Thus  (.V,.  Y,,  d>,)  completely  describes  the  state  of  the 
/th  target  replication  at  a  given  moment. 

Figure  I  provides  a  schematic  diagram  for  the  operation  of  the  CASP  system  All  of  the 
programs  mentioned  will  be  discussed  individually  in  subsequent  subsections  The  first  step  is 
lo  construct  a  file  (called  the  "target-triple  file")  consisting  of  samples  from  the  target  location 
probability  distribution  at  the  time  of  the  distress  incident.  This  file  is  stored  on  computer  disc 
and  processed  sequentially  by  various  programs 

These  initial  points  (A  .  >  .  I)  have  failure  probabilities  9>,  -  1.  since  no  search  has  yet 
been  carried  out  The  target  positions  f.V  ,  >  I  are  sampled  from  a  probability  density  function 
f  of  the  form 
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Monte  Carlo  samples  from  a  probability  density  /-  are  obtained  by  first  using  one  of  the 
"generation  programs"  POSITION,  AREA,  or  TRACKLINE  Averages  of  densities  of  different 
types  are  obtained  by  forming  preliminary  target  triple  files  with  two  or  more  "generation"  pro¬ 
grams  and  then  combining  them  with  the  program  COMBINATION  The  construction  of  the 
prior  target  location  probability  distribution  is  shown  schematically  in  figure  1(a) 


Updates  for  target  motion  (figure  1(b))  or  to  account  for  negative  search  results  (figure 
1(c))  are  carried  out  by  reading  the  "old”  target  triple  file  from  disc  into  the  appropriate  pro¬ 
gram  and  outputting  a  "new"  target  triple  file  When  program  DRIfT  is  used  (figure  1(b)).  the 
values  of  X  and  Y,  are  modified,  but  the  value  of  <J>,  remains  unchanged  for  an  update  for 
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negative  search  results,  the  file  is  first  updated  for  motion  by  use  of  program  DRIFT.  The  tar¬ 
get  triple  file  is  frozen  at  the  mid-search  time  and  then  modified  by  RECTANGLE  or  PATH. 
These  programs  modify  <t>,  by  use  of  Bayes’  theorem  but  the  position  variables  X,  and  Y, 
remain  the  same  since  motion  is  frozen. 

The  probability  disuibutions  and  optimal  allocations  of  search  effort  are  displayed  using 
program  MAP  or  MULTI  (Figure  1(d)).  In  both  cases,  this  is  a  read-only  operation,  and  the 
target  triple  file  is  not  modified. 


Display 

The  MAP  program  displays  the  target  location  probability  distributions  in  a  two  dimen¬ 
sional  format.  Figure  2  shows  an  example  of  a  probability  map  corresponding  to  an  actual  SAR 
case.  The  geographical  region  is  divided  into  cells  oriented  north-south  and  east-west  and  the 
target  location  probabilities*  for  each  cell  are  multiplied  by  10,000  and  displayed.  Thus,  the 
number  1800  in  a  cell  indicates  that  the  target  location  probability  is  .18.  Equal  probability  con¬ 
tours  are  usually  sketched  to  make  it  easier  to  visualize  the  probability  distribution. 
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A  “quick  map"  in  which  symbols  are  used  to  represent  ranges  of  probabilities  can  also  be 
output.  The  quick  map  provides  a  compact  version  of  the  probability  distribution  which  is  suit¬ 
able  for  a  quick  appr.i  sal  of  the  search  situation  and  is  convenient  for  inclusion  in  afier-action 
reports 

Finally.  MAP  can  output  an  ordered  list  of  the  highest  probability  cells  and  the  amount  of 
efTort  to  be  placed  in  each  cell  in  order  to  maximize  detection  probability.  More  will  be  said 
about  search  optimization  in  the  last  subsection. 
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Initial  Target  Location  Probability  Distribution 

The  initial  target  location  probability  distribution  is  construcied  from  "building  block  dis¬ 
tributions"  using  a  weighted  scenario  approach.  The  individual  building  block  distributions  are 
generated  by  the  use  of  one  or  more  of  the  programs  POSITION,  TRACKLINE,  and  AREA 
Program  COMBINE  is  used  to  combine  the  outputs  of  the  individual  "generation"  programs 

in  most  SAR  cases,  there  is  scant  information  available  about  the  target's  position  at  the 
time  of  distress.  Sometimes,  for  example,  a  fisherman  simply  is  reported  overdue  ai  the  end  of 
a  day.  He  may  have  been  planning  to  fish  in  one  of  several  fishing  grounds  but  did  not  make 
his  precise  intentions  known. 

In  other  cases,  more  information  is  available.  Eor  example,  it  might  be  known  that  a 
vacationer  was  intending  to  sail  from  one  marina  to  another  but  never  arrived  at  the  intended 
destination.  In  some  cases,  it  might  also  be  known  that  there  was  bad  weather  along  the 
intended  route.  This  would  make  some  positions  along  track  more  likely  for  a  distress  than 
others. 

In  order  to  encourage  inclusion  of  diverse  possibilities  in  these  scenarios,  it  is  a  recom¬ 
mended  practice  for  two  or  three  search  planners  to  work  out  the  details  together  The 
remainder  of  this  subsection  will  describe  the  programs  POSITION,  AREA,  and  TRACKLINi 
which  are  used  to  simulate  the  scenarios  and  generate  the  initial  target  location  probability  dis¬ 
tribution. 

Position.  A  POSITION  scenario  has  two  parts,  an  initial  position  and  a  subsequent  dis¬ 
placement.  POSITION  can  be  used  to  generate  a  weighted  average  of  as  many  as  ten  scenarios 

The  initial  position  probability  distribution  is  modeled  as  a  bivariate  normal  distribution, 
and  the  displacement  is  modeled  as  a  distribution  over  an  arnular  sector  In  the  latter  distribu¬ 
tion.  the  angle  and  distance  random  variables  are  assumed  to  be  independent  and  uniformly 
distributed  between  minimum  and  maximum  values  input  by  the  user  The  displacement  distri¬ 
bution  is  useful,  for  example,  in  cases  where  the  initial  position  corresponds  to  the  last  fix  on 
the  target  and  where  one  can  estimate  the  course  and  speed  of  subsequent  movement  prior  to 
the  occurrence  of  the  distress  incident. 

The  displacement  option  can  also  be  used  in  cases  involving  a  "bail  out"  where  it  can 
describe  the  parachute  drift.  The  amount  of  displacement  in  this  case  will  depend  upon  the 
altitude  of  the  aircraft  and  the  prevailing  winds  at  the  time  Since  these  factors  are  rarely 
known  precisely,  the  capability  to  "randomize"  direction  and  distance  is  an  important  feature 

Area  The  second  generation  program  is  AREA  This  program  is  used  to  generate  an  ini¬ 
tial  target  location  probability  distribution  in  cases  where  a  general  region  can  be  postulated  for 
the  location  of  the  distress  incident  but  where  a  normal  distribution  simulated  by  POSH  ION 
would  be  a  poor  representation  of  the  uncertainty  Each  scenario  for  program  AREA  deter¬ 
mines  a  uniform  probability  density  within  a  convex  polygon  ARI  A  might  he  used,  for  exam¬ 
ple,  when  a  lost  fisherman's  usual  fishing  ground  is  known  from  discussions  with  friends  and 
relatives.  As  with  POSITION,  AREA  can  generate  a  weighted  average  of  Ml  scenarios 
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Track  line.  The  third  and  Iasi  generation  program  is  TRACKLINE.  This  program  is  the 
most  complex  of  the  generation  programs  and  is  used  when  target  track  information  is  available 
from  a  float  plan  or  some  other  source.  TRACKLINE  creates  a  probability  distribution  about  a 
base  track  This  irack  can  be  constructed  from  as  many  as  10  segments,  each  of  which  can  be  a 
portion  of  a  rhumb  line  or  of  a  great  circle. 

The  motion  of  the  target  about  each  base  track  segment  is  specified  by  three  circular  nor¬ 
mal  probability  distributions  corresponding  to  target  position  at  the  initial,  mid-point,  and  end¬ 
point  of  each  segment.  Each  simulated  target  track  is  obtained  by  drawing  random  numbers  for 
target  position  from  these  distributions  and  then  connecting  the  points  with  straight  lines. 

Figure  3  illustrates  a  typical  situation.  The  target’s  point  of  departure  and  intended  desti¬ 
nation  are  assumed  known,  and  a  base  track  is  constructed  between  these  points.  The  base 
track  might  be  taken  from  the  target's  float  plan  or  hypothesized  from  the  target’s  past  habits. 
In  the  case  illustrated  by  figure  3,  there  are  three  track  segments.  The  50%  circles  of  uncer¬ 
tainty  are  assumed  to  grow  in  size  to  about  midway  along  the  track  and  then  diminish.  Since 
the  point  of  departure  and  intended  destination  are  assumed  to  be  known,  the  extreme  end¬ 
points  of  the  entire  track  have  zero  uncertainty. 
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Trackline.  The  third  and  last  generation  program  is  TRACKLINE.  This  program  is  the 
most  complex  of  the  generation  programs  and  is  used  when  target  track  information  is  available 
from  a  float  plan  or  some  other  source.  TRACKLINE  creates  a  probability  distribution  about  a 
base  track.  This  track  can  be  constructed  from  as  many  as  10  segments,  each  of  which  can  be  a 
portion  of  a  rhumb  line  or  of  a  great  circle. 

The  motion  of  the  target  about  each  base  track  segment  is  specified  by  three  circular  nor¬ 
mal  probability  distributions  corresponding  to  target  position  at  the  initial,  mid-point,  and  end¬ 
point  of  each  segment.  Each  simulated  target  track  is  obtained  by  drawing  random  numbers  for 
target  position  from  these  distributions  and  then  connecting  the  points  with  straight  lines. 

Figure  3  illustrates  a  typical  situation.  The  target’s  point  of  departure  and  intended  desti¬ 
nation  are  assumed  known,  and  a  base  track  is  constructed  between  these  points.  The  base 
track  might  be  taken  from  the  target's  float  plan  or  hypothesized  from  the  target’s  past  habits. 
In  the  case  illustrated  by  Figure  3,  there  are  three  track  segments.  The  50%  circles  of  uncer¬ 
tainty  are  assumed  to  grow  in  size  to  about  midway  along  the  track  and  then  diminish.  Since 
the  point  of  departure  and  intended  destination  are  assumed  to  be  known,  the  extreme  end¬ 
points  of  the  entire  track  have  zero  uncertainty. 
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In  some  cases,  there  is  information  which  leads  one  to  suspect  that  the  distress  is  more 
likely  to  have  occurred  on  one  part  of  the  track  than  on  another  For  example,  as  mentioned 
above,  the  track  may  have  passed  through  an  area  of  storms  and  heavy  seas.  If  desired,  the  tar¬ 
get  location  probability  distribution  generated  by  TRACKLINE  can  be  made  to  have  a  higher 
density  in  such  an  area.  This  is  done  by  specifying  the  highest  probability  point  along  base 
track  together  with  the  odds  that  the  d  stress  occurred  there  rather  than  at  the  extreme  end¬ 
points  of  the  track.  These  inputs  detei.riine  a  truncated  triangular  probability  density  for  the 
fraction  of  track  covered  before  the  distress  incident  occurred. 

Updating  for  Target  Motion 

The  DRIFT  program  is  used  to  alter  a  target  location  probability  distribution  to  account 
for  the  effects  of  drift.  Normaliv,  the  DRIFT  program  will  cause  the  center  of  the  distribution 
to  move  to  a  new  location  and  the  distribution  to  become  more  diffuse. 

Target  motion  due  to  drift  complicates  the  maritime  search  problem.  The  prediction  of 
drift  must  account  for  the  effects  of  both  sea  current  due  to  prevailing  circulation  and  predicted 
or  observed  surface  wind.  Any  object  floating  free  on  the  ocean  surface  is  transported  directly 
by  surface  current,  and  one  component  vector  of  drift  is  therefore  equal  to  the  predicted 
current  vector.  A  statistical  file  collected  from  ship  reports  over  many  years  has  been  assem¬ 
bled  by  the  Coast  Guard  and  arranged  by  geographical  location  and  month  of  the  year.  The  file 
in  use  in  the  CASP  system  covers  most  of  the  North  Atlantic  and  North  Pacific  Oceans. 

As  mentioned  above,  wind  is  also  important  in  predicting  target  motion.  With  regard  to 
this  factor,  there  are  two  major  considerations.  The  first  is  the  drift  caused  by  the  wind  imping¬ 
ing  on  the  drifting  object's  surface  area  above  water;  this  is  called  "leeway."  The  speed  and 
direction  of  leeway  is  different  for  different  objects,  and  is  usually  difficult  to  predict. 

The  second  wind  consideration  is  the  movement  of  the  surface  layer  of  the  ocean  itself, 
this  is  called  "local  wind  current."  It  is  one  of  the  most  complex  and  least  understood 
phenomena  in  the  entire  drift  process. 

The  primary  data  source  for  surface  winds  in  the  CASP  system  is  the  Navy's  Fleet 
Numerical  Weather  Central  in  Monterey,  California.  Every  twelve  hours  their  computers  gen¬ 
erate  a  time  series  for  hemispheric  wind  circulation,  three  of  these  time  series  are  used  to  pro¬ 
duce  certain  geographical  blocks  of  wind  data  which  are  transmitted  to  the  Coast  Guard  for  use 
by  CASP.  All  data  are  retained  in  the  system  for  two  to  three  months. 

The  process  of  applying  the  drift  motion  to  update  a  CASP  distribution  is  simple  enough 
First,  a  set  of  total  drift  vector  probability  distributions  is  computed  for  various  geographical 
areas  based  upon  estimates  of  sea  current,  leeway,  and  local  wind  current.  Then  for  each  target 
location  replication,  a  random  vector  of  net  drift  is  drawn  from  the  appropriate  probability  dis¬ 
tribution  and  used  to  move  the  target  forward  a  short  time.  The  procedure  is  repealed  until  the 
entire  update  time  is  taken  into  account. 

Updating  for  Negative  Search  Results 

Once  a  search  has  actually  been  conducted,  one  of  the  two  search  update  programs.  REC¬ 
TANGLE  and  PATH  (depending  upon  the  type  of  search),  is  run  to  revise  the  target  location 
probabilities  to  account  for  unsuccessful  search  The  effect  is  to  reduce  the  probabilities  within 
the  area  searched,  and  to  increase  them  outside 
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l  pdaiing  i lie  target  location  probabilities  lor  negative  search  results  is  carried  out  by  an 
application  ot  Bases'  theorem.  Recall  that  the  target  triple  file  contains  J  records  of  the  form 
(  \  .  >  .  <J>  l  for  U  j  <  I  where  the  pair  ( A", .  Y,)  represents  target  position,  and  <t> , 
represents  the  probability  that  the  target  replication  would  not  have  been  detected  by  the  cumu¬ 
lative  search  ellort  under  consideration.  The  overall  cumulative  probability  of  detection  taking 
all  simulated  targets  into  account  is  called  search  effectiveness  probability  (SEP)  and  is  com¬ 
puted  by  the  formula 

i 

si  p  =  i  £  a>  /y. 


Let  (  be  a  region  in  the  search  area,  and  let  denote  the  event,  "target  corresponds  to 
the  /th  replication  and  is  in  region  ('."  The  posterior  probability  A(C)  that  the  target  is  located 
in  (  given  search  failure  is  computed  using  Bayes’  theorem  by 

./ 

\U  I  =  /V (Target  in  ('|  -Search  failure }  =  Y  Pr[B,\  Search  failure) 

i- 1 

,/ 

=  Y  (Search  failure  I  B]  Pr  { B ,}/ Pr  {Search  failure) 

i 

-  Y  ■»'  /  Y  <lv 

■I  .  ! 

where  I  -  |  /  :  t  \  .  Y  >  (  ( ')  denotes  the  set  of  indicies  corresponding  to  target  replications  in  ( 

Now  suppose  that  //,  denotes  the  probability  of  failing  to  detect  the  ,/th  target  replication 
during  a  particular  update  period.  Using  the  independence  assumption,  the  new  individual 
cumulative  failure  probability  <l>  is  computed  by 

T  = 

where  *l>'  denotes  the  cumulative  failure  probability  prior  to  the  last  increment  of  search. 

I  lie  compulation  of  the  conditional  failure  probability  q,  is  carried  out  in  C’ASP  by  use  of 
a  (  M.  /(.  it  (-detection  model  its  described  below.  Recall  (e  g.,  see  Koopman  [21)  that  the 
"lateral  range"  between  searcher  and  target  (both  with  constant  course  and  speed)  is  defined  as 
the  distance  a.  closest  point  of  approach.  The  "lateral  range  function"  gives  single  sweep  cumu¬ 
lative  detection  probability  for  a  specified  lateral  range  for  a  specified  period  of  time.  The 
integral  ot  the  lateral  range  function  is  called  the  "sweep  width"  of  the  sensor. 

Lite  (  ASP  programs*  arc  based  upon  the  assumption  that  the  lateral  range  function  for 
the  search  unit  is  rectangular  and  is  described  by  two  parameters.  M  and  fi.  Mere  M  denotes 
the  total  width  of  the  swept  path,  and  fi  denotes  the  probability  that  the  target  would  be 
delected  lor  lateral  ranges  less  than  or  equal  to  M/2.  The  sweep  width  W*' for  the  rectangular 
lateral  range  function  described  above  is  given  by 

It  -  fi  M 

Navigational  uncertainties  ("pattern  error")  are  introduced  into  the  detection  model  by 
assuming  each  sweep  is  a  random  parallel  displacement  from  the  intended  sweep.  The  random 
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displacements  are  assumed  to  be  independent  identically  distributed  normal  random  variables 
with  zero  mean  and  standard  deviation.  This  model  was  introduced  by  R.  K.  Reber  (e  g.,  see 
Reber  (4))  and  used  extensively  in  certain  Navy  search  analyses. 


Rectangular  lateral  range  functions  are  a  useful  way  of  approximating  more  complex 
lateral  range  functions.  If  the  actual  lateral  range  function  has  sweep  width  M  and  is  nonzero 
over  an  interval  of  width  M ,  then  one  may  define  fi  to  be  the  average  detection  probability  over 
the  effective  range  of  the  sensor,  i.e.,  /3  =  W/ M.  Appendix  A  of  [4]  shows  that  replacement  of 
the  actual  lateral  range  function  by  a  rectangular  lateral  range  function  with  average  probability 
/3  usually  does  not  lead  to  significant  errors  in  the  computed  value  of  probability  of  detection 
for  parallel  path  search.  Cases  where  there  is  significant  disagreement  occur  when  the  lateral 
range  function  is  close  to  zero  over  a  large  part  of  its  support. 


Let  G„  denote  the  cumulative  normal  probability  distribution  function.  Let  (u.  v)  denote 
the  target’s  position  in  a  coordinate  system  where  the  origin  is  at  the  midpoint  of  a  given 
sweep,  and  where  the  u-axis  is  parallel  to  the  sweep  and  the  v-axis  is  perpendicular  to  the 
sweep.  Then  for  fixed  M,  1$,  and  <t,  the  single  sweep  probability  p(u,  v)  of  detecting  the  target 
is  given  by 


(1) 


L 

a 

G„ 

U  +  1 

~  G„ 

L 
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v  + 


M 
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~  G„  v 


M 
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where  L  denotes  the  length  of  the  sweep. 


If  there  are  K  search  legs  to  be  considered,  and  if  <//,\  v(M  denotes  the  coordinates  of  the 
_/th  simulated  target  position  relative  to  the  Ath  search  leg,  then  the  failure  probability  q,  is 
given  by 

<2)  </,  =  n  i 1  _  v/A' )i- 

*-i 

The  application  of  these  formulas  in  programs  PATH  and  RKCTANGLK  can  now  be  discussed. 


Path.  Program  PATH  is  used  to  represent  general  search  patterns  constructed  from 
straight  track  segments.  For  example,  PATH  can  be  used  to  compute  detection  probabilities  for 
a  circle  diameter  search  where  the  search  tracks  are  intended  to  cover  a  given  circle  by  making 
repeated  passes  through  its  center.  PATH  makes  direct  use  of  (I)  and  (2). 


Rectangle.  Program  RECTANGLE  has  been  designed  for  the  special  case  where  a  rectan¬ 
gle  is  searched  using  parallel  sweeps.  RECTANGLE  reduces  the  computing  time  and  amount 
of  input  that  otherwise  would  be  required  using  program  PATH.  For  a  point  outside  the  desig¬ 
nated  rectangle,  the  probability  of  detection  q ,  is  assumed  to  be  0  For  a  point  inside  the  desig¬ 
nated  rectangle,  "edge"  effects  are  ignored  and  an  average  probability  of  detection  is  computed 
as  if  there  were  an  infinite  number  of  sweeps,  each  infinitely  long. 

The  following  line  of  reasoning  originated  with  R  K  Reber  Reber  |4|  presents  results  in 
the  form  of  curves  and  tables,  and  these  have  been  adapted  to  program  RECTANGLE  by  use 
of  polynomial  approximations.  Let  S  denote  the  spacing  between  sweeps  Since  the  sweeps  are 
assumed  to  be  parallel  and  of  infinite  extent,  the  coordinate  v‘  expresses  the  lateral  range  for 
the  Ath  sweep  and  the  ,/lh  simulated  target  location  and  is  given  by 

v  *  =  n  ,  +  A  .S' 

for  -oo  <  A  <  °°  and  a  number  m,  such  that  Im  ,I  $  -V 
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Now  for  arbitrary  n,  refer  to  (1)  and  (2)  and  define  g  by 

(3)  g(n.  s)  =  n  ii  -  *»(«./*  +  *s)i  =  n  p 

K  -  •»  k  -  oo 

Note  that  since  the  sweeps  are  assumed  to  be  of  infinite  length,  one  has  u  =  <»  and  £  defined 
by  (3)  does  not  depend  upon  u.  The  function  g  is  periodic  in  its  first  argument  with  period  /*. 
Let  g(S)  denote  the  average  value  of  g(p,  s)  with  respect  to  the  first  argument.  Then 

*(S)  “  Ij  X  S) 

The  function  g  has  been  tabulated  in  [4]  and  is  used  in  program  RECTANGLH  to 
represent  the  failure  probability  q,  =  gUS)  for  a  point  lying  within  the  designated  search  rectan¬ 
gle.  RECTANGLE  and  PATH  agree  (as  they  should)  when  PATH  is  used  to  represent  a  paral¬ 
lel  path  search. 

Search  Optimization 

Two  programs,  MAP  and  MULTI,  are  used  for  optimizing  the  allocation  of  search  effort. 
MAP  provides  a  quick  way  of  determining  the  search  cells  which  should  receive  effort  based 
upon  a  constraint  on  total  track  line  miles  available.  MULTI  determines  search  areas  for  multi¬ 
ple  search  units  under  the  constraint  that  each  unit  must  be  assigned  a  uniform  coverage  of  a 
rectangle  and  that  the  rectangles  for  the  various  search  units  do  not  overlap. 

The  method  used  in  program  MAP  is  based  upon  use  of  an  exponential  detection  function 
(see  Stone  |X|>  introduced  by  Koopman  13)  and  does  not  impose  constraints  on  the  type  of 
search  pattern  employed  The  primary  usefulness  of  this  program  is  to  provide  the  search 
planner  with  a  quick  method  for  defining  the  area  of  search  concentration.  The  following  para¬ 
graphs  give  a  brief  sketch  of  the  methods  used  in  these  optimization  programs. 

Map  Let  there  be  .V  search  cells,  and  for  \  %  n  ^  N  let  p„  and  «„  denote,  respectively, 
the  target  location  probability  and  the  area  associated  with  the  nth  cell  The  probability  density 
for  target  location  in  the  nth  cell  is  given  by  </,,  =  Suppose  that  total  search  effort  is 

measured  by  the  product  of  track  Ime  miles  and  sweep  width. 

Let  y  denote  an  allocation  of  search  effort  where  y  in)  denotes  the  amount  of  search 
effort  (measured  in  area  swept)  allocated  to  the  nth  cell.  Probability  of  detection  Pn |y)  is  com¬ 
puted  using  an  exponential  effectiveness  function,  i.e., 

\  ' 

/*/> (y  I  =  £  /'.,(l  cxp(— y  (n  )/<*„)). 

•i  ■  i 

The  objective  is  to  maximize  l’n  subject  to  a  constraint  on  total  effort  available  This  is  easily 
done  using  the  techniques  introduced  by  Koopman  [31.  easier  proofs  are  provided  in  Stone  IXl 
and  Wagner  1 1  2] 

It  can  be  shown  that  under  the  above  assumptions,  the  initial  increments  of  effort  should 
be  concentrated  in  the  highest  probability  density  cells,  and  that  there  should  be  a  succession  of 
expansions  to  cells  having  lower  target  location  probability  density 

In  order  to  derive  the  formulas  used  in  program  MAP,  a  new  collection  of  equi-density 
search  regions  is  formed  made  up  of  the  unions  of  all  cells  having  equal  probability  density 
Let 
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K  =  the  number  of  equi-density  regions 
</t  =  the  probability  density  for  region  A 

tK  =  the  set  of  indices  corresponding  to  I  he  cells  comprising  region  A 
=  the  area  of  region  A. 

Using  the  above  notation 

■u  “  L  “  • 

Let  £,  denote  the  total  effort  which  must  be  expended  before  the  optimal  search  expands  into 
the  Ath  region.  Assume  that  the  equi-densily  regions  have  been  ordered  beginning  with  the 
region  having  the  highest  density.  Since  search  begins  in  the  highest  density  region,  we  have 
/■•  !  =  0  It  can  be  shown  that  in  general  for  A  ^  2 

A  i 

(4)  -  I-:k  ,  +  (In*/*  ,  -  In  </A>  £  A.„. 

..i-i 


figure  4  shows  output  from  program  MAP  illustrating  the  use  of  (4).  The  list  shows  the 
2s  highest  probability  cells  specified  by  the  latitude  and  longitude  of  the  southeast  corner.  Each 
cell  is  15  minutes  wide,  and  the  numbers  in  the  last  column  correspond  to  the  values  /•.<  given 
b>  (41  The  planning  advice  given  in  |I0)  is  to  apply  search  effort  to  any  cell  for  which  the 
value  m  the  effort  column  is  less  than  the  total  effort  available. 
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Notice  that  the  numbers  in  the  effort  column  are  not  necessarily  increasing.  This  is 
because  the  list  is  ordered  according  to  containment  probability  rather  than  probability  density 

Muhi.  As  mentioned  above,  program  MAP  does  not  take  into  account  "simplicity"  con¬ 
straints  which  are  considered  important  in  operational  planning.  Program  MULTI  was  designed 
to  overcome  this  drawback  in  cases  where  multiple  search  units  are  deployed  in  the  same  search 
area. 

The  first  simplicity  constraint  introduced  is  that  each  unit  will  be  assigned  to  uniformly 
search  a  rectangle.  Figure  5  shows  the  dimensions  of  the  optimal  rectangle  and  the  resulting 
probability  of  detection  under  the  assumption  that  the  target  location  probability  distribution  is 
normal.  In  order  to  use  this  figure,  one  first  computes  the  normalized  effort  A*  by  the  formula 


nt.i  \  ir  min 


where  R  is  the  sweep  rate  of  the  unit.  T  is  the  total  search  time,  and  <rnu,  and  <rmm  are  the 
standard  deviations  of  the  normal  distribution  when  referred  to  principal  axes.  The  optimal 
search  rectangle  will  have  half  side  given  by  and  where  the  size  factor  i  *  is 

given  by  the  designated  curve  with  values  read  along  the  outer  vertical  scale. 


Figure  5  provides  curves  to  determine  the  probability  of  detection  for  the  optimal  rectan¬ 
gle  plan  and  for  the  unconstrained  optimal  plan  It  is  interesting  to  note  that  in  all  cases  the 
probability  of  detection  provided  by  the  optimal  rectangle  plan  is  at  least  95%  of  that  provided 
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by  the  unconstrained  optimal  plan.  Thus,  under  the  assumption  stated,  uniform  search  of  the 
optimal  rectangle  can  be  recommended  without  hesitation  since,  in  most  cases,  the  simplicity  of 
the  rectangle  plan  is  more  important  than  the  small  improvement  in  effectiveness  obtained  by 
the  more  complicated  optimal  plan. 

MULTI  is  capable  of  allocating  the  effort  of  up  to  5  search  units  to  nonoverlapping  rectan¬ 
gles  in  a  way  which  is  intended  to  maximize  overall  probability  of  detection.  The  first  step  in 
this  procedure  is  to  approximate  the  target  location  probability  distribution  by  the  weighted 
average  of  k  bivariate  normal  distribution  where  /  <  £  <  3.  This  is  done  by  locating  the  three 
highest  local  maxima  in  the  smoothed  cell  distribution  and  then  associating  each  simulated  tar¬ 
get  position  with  the  nearest  cluster  point.  If  three  local  maxima  cannot  be  found,  then  the 
procedure  is  carried  out  with  one  or  two  local  maxima.  The  mean  and  covariance  matrix  of 
each  cluster  are  calculated  to  determine  the  parameters  of  the  approximating  normal  distribu¬ 
tion. 


The  program  next  considers  all  possible  assignments  of  search  units  to  one  of  the  three 
approximating  probability  distributions.  Since  there  are  a  maximum  of  five  units  and  three  dis¬ 
tributions,  there  are  at  most  3  s  =  243  different  ways  of  assigning  units  to  distributions.  For 
each  assignment,  the  program  sums  up  the  total  effort  available  to  search  each  distribution  and 
then  computes  the  resulting  optimal  rectangle  and  associated  probability  of  detection.  If  Pk 
denotes  the  conditional  probability  of  detecting  the  target  with  optimal  rectangle  search  given 
that  the  target  has  the  fcth  distribution  (1  ^  k  ^  k),  then  probability  of  detection  A  for  the 
allocation  is  given  by 

A  -  £  PkDk. 

*-i 


The  program  prints  the  allocation  which  gives  the  maximum  probability  of  detection  and 
notes  whether  any  of  the  rectangles  overlap.  If  overlap  occurs,  then  the  next  ranking  allocation 
is  printed,  and  so  on.  This  continues  until  an  allocation  without  overlap  is  found  or  until  the 
top  five  allocations  have  been  listed  together  with  their  associated  detection  probabilities. 
Finally,  when  several  units  are  assigned  to  the  same  rectangle,  it  is  subdivided  in  a  way  which 
preserves  the  uniform  coverage. 

Recently  an  alternative  method  for  multiple  unit  allocation  has  been  developed  (see  Dis- 
cenza  [1])  based  upon  integer  programming  considerations. 

3.  CASP  CASE  EXAMPLE 

On  12  September  1976  the  sailing  vessel  S/V  Spirit  departed  Honolulu  enroute  San  Fran¬ 
cisco  Bay.  The  owner,  who  was  awaiting  its  arrival  in  San  Francisco,  reported  concern  for  the 
vessel  to  the  Coast  Guard  on  14  October  1976  after  it  had  failed  to  arrive.  An  Urgent  Marine 
Information  Broadcast  (UMIB)  was  initiated  on  17  October.  The  following  day,  a  merchant 
vessel  the  M/V  Oriental  Financier  reported  recovering  a  life  raft  with  two  survivors  from  the 
S/V  Spirit  which  had  sunk  in  heavy  seas  in  mid-Pacific  on  the  morning  of  27  September.  Sur¬ 
vivors  indicated  three  more  crewmembers  in  a  separate  raft  were  still  adrift.  This  information 
opened  an  extensive  six  day  air  and  surface  search  for  the  missing  raft  that  eventually  located 
the  raft  with  one  of  the  missing  persons  on  board 
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Each  day’s  search  was  planned  utilizing  computer  SAR  programs.  Initial  distress  position 
information  was  gained  by  radio-telephone  debriefing  of  the  survivors  aboard  the  M/B  Oriental 
Financier  on  several  occasions.  The  search  began  19  October  based  on  a  SARP*  datum  for  a 
raft  without  a  drogue  from  an  initial  reported  position  of  36N  136W.  The  second  day’s  search 
was  based  on  a  SARP  datum  for  a  position  160  nautical  miles  lo  the  northeast  from  the  previ¬ 
ous  position  (this  position  being  determined  from  further  debriefing  of  the  survivors  over 
radio-telephone).  The  third  through  the  six  days'  searches  were  planned  utilizing  CASP  output 
from  a  POSITION  scenario  consisting  of  an  ellipse  with  a  160  mile  major  axis  and  a  60  mile 
minor  axis.  The  CASP  program  was  updated  by  RECTANGLE  and  DRIFT  daily,  and  search 
areas  assigned  to  cover  the  highest  cells  which  could  be  reached  taking  into  account  search  unit 
speed  and  endurance. 

The  following  chronology  is  based  upon  the  official  USCG  report  and  describes  the  utiliza¬ 
tion  of  CASP  in  the  search  planning.  This  case  is  a  good  illustration  of  the  many  uncertainties 
which  must  be  analyzed  during  a  search  and  the  way  bdth  negative  and  positive  information 
contribute  to  eventual  success. 

21  October  1976 

Search  planning  for  the  day’s  operations  utilized  the  CASP  program  for  the  first  time. 
New  probable  distress  position  information  given  by  the  survivors  was  evaluated  and  the  CASP 
program  was  initiated  using  a  POSITION  scenario  with  center  length  160  miles  and  width  60 
miles  oriented  on  046°T,  with  the  southwest  end  at  position  36N  136W.  This  scenario  was  to 
be  used  for  the  rest  of  the  search.  A  search  plan  was  generated  for  the  21  October  search  cov¬ 
ering  approximately  8  of  the  10  highest  CASP  cells  as  given  in  MAP.  Ten  units  were  desig¬ 
nated  for  the  day’s  efforts  and  consisted  of  3  Coast  Guard,  2  Navy,  and  4  Air  Force  aircraft  and 
the  USS  Cook. 

The  first  aircraft  which  arrived  on  scene  for  the  day’s  search  reported  the  weather  in  the 
search  area  as  ceiling  varying  200-1500  feet  (scattered),  wind  from  330°  at  8  knots,  seas  4  feet, 
and  visibility  unlimited  except  in  occasional  rain  showers 

At  3:06  PM  an  aircraft  located  what  appeared  to  be  the  life  raft  of  recovered  survivors  in 
position  35-38N  138-12W.  M/V  Oriental  Financier  had  been  unable  to  recover  this  raft  when 
the  survivors  were  rescued.  The  USS  Cook  investigated  and  reported  negative  results. 

Figure  6  shows  the  search  plan  for  21  October.  Note  that  the  target  was  eventually  found 
on  24  October  in  the  first  designated  area  C-l.  There  is,  of  course,  no  way  of  knowing  where 
the  target  was  on  the  2 1st. 

22  October  1976 

Planning  for  day’s  search  was  done  using  updates  from  the  CASP  program  Search  units, 
consisting  of  17  aircraft  (3  Coast  Guard,  6  Navy,  and  8  Air  Force)  and  the  USS  Cook,  were 
designated  areas  totaling  67,920  square  miles  for  the  day’s  effort  Areas  assigned  were  deter¬ 
mined  from  the  MAP’s  twelve  highest  cells  High  altitude  photographic  reconnaissance  flight 
utilizing  U-2  aircraft  was  also  scheduled,  cloud  coverage  permitting,  to  cover  an  area  of  57,600 
square  miles 


A  computer  program  implementing  methods  described  in  ihe  National  SAR  Manual  and  a  precursor  to  (  ASP 
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The  first  aircraft  on  scene  for  the  day’s  search  reported  the  weather  in  the  general  area  as 
ceiling  1800  feet  (broken),  winds  from  150°  at  6  knots,  seas  2  feet,  and  visibility  15  miles. 

Search  conducted  during  daylight  hours  utilized  15  aircraft,  the  USS  Cook,  and  a  U2  high 
altitude  reconnaissance  flight.  The  USS  Cook  was  unable  to  relocate  debris  sighted  during  pre¬ 
vious  day’s  search.  Two  Air  Force  aircraft  failed  to  arrive  on  scene  prior  lo  darkness  and  were 
released.  Aircraft  on  scene  searched  88  percent  of  67,920  square  miles  assigned  and  obtained 
POD’s  ranging  from  50  to  82  percent.  The  high  altitude  photographic  reconnaissance  flight  was 
conducted  from  an  altitude  of  approximately  50,000  feet. 

The  CGC  Campbell  arrived  on  scene  and  relieved  the  USS  Cook. 

23  October  1976 

The  Rescue  Coordination  Center  (RCC)  was  advised  by  the  Air  Force  that  development 
of  high  altitude  film  had  shown  an  "orange  dot"  in  position  35-I6N  139-05W  The  photo¬ 
graphed  object  was  described  as  a  round  orange  object,  approximately  7  feet  in  diameter,  float¬ 
ing  on  the  surface  of  the  water. 

Search  planning  was  done  using  updates  from  the  CASP  program.  Search  units,  consist¬ 
ing  of  the  CGC  Campbell  and  8  aircraft  (2  Coast  Guard,  3  Navy,  and  3  Air  Force),  were 
assigned  areas  of  highest  CASP  cells.  The  object  photographed  by  reconnaissance  aircraft  was 
drifted  by  SARP  and  the  CGC  Campbell  and  1  aircraft  dedicated  to  locate  it. 

The  first  aircraft  on  scene  for  the  day’s  search  reported  weather  in  the  search  area  as  ceil¬ 
ing  2000  feet,  wind  from  200°  at  12  knots,  seas  2  feel,  and  visibility  15  miles. 

Search  conducted  during  daylight  hours  utilized  8  aircraft  and  CGC  Campbell.  Search 
units  covered  97  percent  of  the  assigned  34,300  square  miles  with  POD’s  ranging  from  50  to  92 
percent.  Several  sightings  of  assorted  flotsam  were  reported  but  none  linked  to  Spirit  or  rafts. 
The  object  photographed  by  the  high  altitude  reconnaissance  flight  on  22  October  was  not  relo¬ 
cated  by  search  units. 

Figure  7  shows  the  search  plan  for  23  October.  Although  not  indicated  in  the  chart,  the 
position  where  the  target  was  found  on  the  24th  is  in  the  second  highest  probability  density  cell 
from  the  CASP  map 

24  October  1976 

Search  planning  for  the  day's  operations  was  done  using  updates  from  the  CASP  program 
Search  units  consisting  of  the  CGC  Campbell  and  5  aircraft  (2  Coast  Guard  and  3  Navy)  were 
assigned  areas  of  highest  CASP  probability  totaling  18,082  square  miles,  with  CGC  Campbell 
and  one  Coast  Guard  aircraft  designated  for  location  of  the  object  reported  by  the  reconnais¬ 
sance  flight. 

The  position  of  the  reconnaissance  flight  sighting  of  22  October  was  drifted  utilizing 
SARP  and  the  new  position  passed  to  CGC  Campbell  for  search  purposes.  The  11:00  AM 
SARP  datum  was  computed  to  be  35-29. 4N  1 38-39. 2W  with  standard  first  search  radius  of  16  9 
miles  The  search  plan  is  shown  in  Figure  8 
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The  first  aircraft  on  scene  for  the  day’s  search  reported  weather  in  the  search  area  as  ceil¬ 
ing  1S00  feet,  wind  from  000°  at  7  knots,  seas  3  feet,  and  visibility  10  miles. 

The  CGC  Campbell  reported  locating  a  rusty,  barnacle  encrusted  55  gallon  drum  in  posi¬ 
tion  35-27. 2N  138-39. 0W 

At  12:05  PM  the  search  met  with  success!  A  Coast  Guard  HC-130H  reported  sighting  a 
raft  in  position  36-03N  138-00W  with  at  least  one  person  on  board  The  CGC  Campbell  pro- 
ceded  enroute  to  investigate,  and  at  2:37  PM  CGC  Campbell  reported  on  scene  with  the  raft  in 
position  35-53N  138-10W.  A  small  boat  was  lowered  to  recover  the  survivor,  and  at  3:01  PM 
all  search  units  were  released  from  the  scene 

4.  TRAINING 

CASP  training  began  with  an  operational  testing  phase  in  cooperation  with  the  New  York 
RCC  This  operational  testing  was  useful  in  orienting  the  personnel  to  the  benefits  derived 
trom  more  detailed  search  planning,  and  provided  an  idea  of  what  the  full  training  problem  was 
going  to  be  like. 

Coincident  with  this,  a  training  manual  19]  and  a  cor,  ,>letel>  new  combined  operating 
handbook  [10)  were  developed  encompassing  all  of  the  operational  computer  services  available 

At  the  time  of  official  implementation  in  February  1974,  a  special  four-day  class  was  con¬ 
ducted  m  the  operation  of  the  CASP  system,  this  class  was  attended  by  one  representative  from 
each  Rescue  Coordination  Center.  It  was  intended  that  these  persons  would  learn  the  system 
thoroughly  and  return  to  their  respective  commands  and  teach  others  This  plan  was  marginalK 
successful,  and  worked  only  in  those  cases  where  an  extremely  capable  individual  was  selected 
for  attendance. 

During  the  next  six  months,  personnel  from  the  Operations  Analysis  Branch  visited  each 
fas:  f 'oj.si  RCC  for  one  week  apiece  in  oruet  to  provide  additional  training.  Subsequently,  the 
same  visit  schedule  was  repeated  on  the  West  Coast. 

Anotber  valuable  tot  !  tor  'raining  has  been  telephone  consultation  Fortunately,  all  mes¬ 
sages  irto  and  out  of  the  computer  are  monitored  at  New  York,  and  personnel  can  be  helped 
with  the  details  of  inpu:  and  output  with  a  quick  telephone  call  on  the  spot. 

Fmaliy.  the  National  Search  and  Rescue  School  has  made  CASP  training  a  regular  part  of 
its  curriculum  I  he  schoo.  loc  ut-o  on  Governor-  Island,  is  responsible  for  initial  training  of  all 
RCC  personnel  (among  many  others)  in  ihe  techniques  of  search  and  rescue  The  present  SAR 
school  naming  session  is  four  weeks  m  duration  with  the  fourth  week  devoted  to  computer 
search  planning  systems  training  Over  halt  of  tins  time  is  devoted  directly  to  CASP 

>'"■  '■  otist  Guard  -  currently  in  the  process  ot  separating  its  administrative  and  opera - 
n-nal  systems  bv  establishing  at,  Operational  Computer  Center  This  new  Center  wall  give  res¬ 
cue  coordinators  dtn-c:  access  to  C  ASP  through  on-line  terminals  and  will  improve  C  ASP  s 
a v ,i dahilit \  and  roiiabdiiv  Interactive  program  control  will  make  the  modules  easier  *o  use 

The  application  of  CASP  in  operational  situations  has  been  quite  successful  in  spile  o! 
Munitiv.iu'  i'i>,  umher.incc  associated  with  computet  and  communications  services 
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Continued  oceanographic  research  programs  will  expand  CASP’s  applicability  to  important 
in-shore  regions.  Implementation  of  the  new  multi-unit  allocation  algorithm  [11  is  expected  to 
simplify  the  search  area  assignment  problem.  These  additional  capabilities  coupled  with 
improved  computer  access  and  reliability  should  make  CASP  an  even  more  valuable  planning 
tool  in  the  future. 
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CONCENTRATED  FIRING  IN  MANY-VERSUS-MANY  DUELS 
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ABSTRACT 

A  simple  stochastic-duel  model,  based  on  alternate  tiring,  is  proposed  This 
model  is  shown  to  be  asymptotically  equivalent,  for  small  hit  probabilities,  to 
other  known  models,  such  as  simple  and  square  duels.  Alternate  tiring  intro¬ 
duces  an  interaction  between  opponents  and  allows  one  to  consider  multiple 
duels  Conditions  under  which  concentrated  tiring  is  belter  or  worse  than 
parallel  tiring  are  found  by  calculation  and  sometimes  by  simulation  The  only 
parameters  considered  are  the  combat  group  sizes  (all  units  within  a  group  are 
assumed  identical),  the  hit  probabilities  and  the  number  of  hits  necessary  to 
destroy  an  opposing  unit 


1.  INTRODUCTION 

Two  extremes  for  the  modeling  combat  attrition  are  given  by  the  so-called  Lanchester 
theory  of  combat,  which  treats  combat  attrition  at  a  macroscopic  level,  and  by  the  theory  of  sto¬ 
chastic  duels,  which  treats  combat  attrition  at  a  microscopic  level  and  considers  individual  firers. 
target  acquisition,  the  firing  of  each  and  every  round,  etc.  (see  Ancker  U.  pp.  388-389]  for 
further  details).  Actual  combat  operations  are,  of  course,  much  more  complex  than  their 
representation  by  such  relatively  simple  attrition  models  and  may  also  be  investigated  by  means 
of  much  more  detailed  Monte  Carlo  combat  simulations.  Unfortunately,  such  detailed  Monte 
Carlo  simulations  usually  fail  to  provide  any  direct  insights  into  the  dynamics  of  combat  wiihoui 
a  prohibitive  amount  of  computational  effort.  In  the  paper  at  hand,  we  will  consider  a  relatively 
simple  stochastic-duel  model  to  develop  some  important  insights  into  a  persisting  issue  of  mili¬ 
tary  tactics  (namely,  what  are  the  conditions  under  which  concentration  of  fire  is  ’'beneficial"). 

In  his  now  classic  1914  paper,  F.W.  Lanchester  (10)  (see  also  111])  used  a  simple  deter¬ 
ministic  differential-equation  model  to  quantitatively  justify  the  principle  of  concentration,  i.e.. 
a  commander  should  always  concentrate  as  many  men  and  means  of  battle  at  the  decisive  point 
From  his  simple  macroscopic  model,  Lanchester  concluded  that  the  "advantage  shown  to  accrue 
from  fire  concentration  as  exemplified  by  the  n  square  law  is  overwhelming."  However,  this 
conclusion  depends  in  an  essential  way  on  the  macroscopic  differential-equation  attrition  model 
used  by  Lanchester  [10],  [11]  (see  Taylor  [14]  for  further  discussion)  and  need  not  hold  for 
microscopic  stochastic-duel  models  of  combat  attrition.  In  fact,  this  paper  shows  that  for  such 
microscopic  duel  models  it  is  not  always " best"  to  concentrate  fire. 

Subsequently,  many  investigators  have  commented  on  the  benefits  to  be  gained  from  con¬ 
centrating  fire.  For  example,  in  his  determination  of  the  probability  of  winning  for  a  stochastic 
analogue  of  Lanchester’s  original  model.  Brown  [6]  stressed  the  fact  that  the  model  applied  to 
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cases  of  concentrated  firing  by  both  sides.  Other  investigators  of  deterministic  I.anchester-npe 
models  from  the  macroscopic  combat-analysis  point  of  view  have  also  stressed  this  point  <e  g 
see  Dolansky  [7],  Taylor  [131,  and  Taylor  and  Parry  [15]).  Recently,  Taylor  [14]  has  examined 
the  decision  to  initially  commit  forces  in  combat  between  two  homogeneous  forces  modeled  b> 
very  general  deterministic  Lanchester-type  equations.  He  showed  that  it  is  not  always  "best”  to 
commit  as  much  as  possible  to  battle  initially  but  that  the  optimal  decision  for  the  initial  com¬ 
mitment  of  forces  depends  on  a  number  of  factors,  the  key  of  which  is  how  the  trading  of 
casualties  depends  on  the  victor’s  force  level  and  time. 

The  first  reference  to  problems  of  strategy  in  multiple  duels  is  found  in  Ancker  and  Willi¬ 
ams  [21,  who  study  the  case  of  a  square  duel  (2  vs  2)  and  arrive  at  the  right  conclusion  that 
parallel  firing  is  better  than  concentrated  firing.  This  is  a  natural  conclusion  since  only  one  hit 
is  necessary  to  achieve  destruction,  and  in  concentrated  firing  there  is  a  certain  amount  of 
over-killing.  In  1967,  Ancker  [11  makes  suggestions  for  future  research  concerning  mutliple 
duels  and  states  explicitly  that  the  difficulties  lie  in  the  strong  interaction  between  the  contes¬ 
tants.  The  possibility  of  needing  more  than  one  hit  to  achieve  destruction  in  the  simple  duel 
situation  was  introduced  by  Bhashyam  [41  in  1970. 

The  purpose  of  this  paper  is  to  combine  some  of  the  above  mentioned  concepts,  in  order 
to  gain  insight  concerning  a  problem  of  strategy  in  multiple  duels— should  one  concentrate 
one's  fire  or  not? 

2.  ASSUMPTIONS  AND  NOTATION 

Let  us  consider  two  forces  A  and  B  that  meet  each  other  in  combat.  A  consists  of  M  units 
and  B  of  N  units. 

The  following  assumptions  arc  made: 

1.  Firing  is  alternating,  volley  after  volley,  i.e.,  A  fires  all  weapons  simultaneously,  then  B 
and  so  on  until  all  units  of  a  force  are  destroyed.  This  is  contrary  to  the  usual  assumption  of 
either  simultaneous  firing  or  random  firing  within  some  lime  intervals  as  found  in  Robertson 
[12),  Williams  [17],  llelmbold  [8],  [9],  Thompson  [16],  Ancker  [3],  It  is  felt,  and  will  be 
shown  in  a  few  cases,  that  for  relatively  small  probabilities  of  hitting,  this  approach  gives  results 
comparable  to  Ancker  and  Williams  [21 .  We  will  denote  by  the  probability  of  /  winning  if  / 
shoots  first  i,  j  =  A,B  The  unconditional  probability  of  winning  will  be  denoted  by  1 4  or  I  w 

2.  Hit  probabilities  are  constant  and  are  respectively  pA  and  pH.  with  q,  =  1  —/»,./«  A.B. 

3.  taeh  unit  of  force  A  requires  K  A  hits  to  be  destroyed.  Same  for  B  and  KH. 

4.  The  supply  of  ammunition  is  unlimited 

5.  There  is  no  time  limit  to  score  a  hit. 

6.  In  a  multiple  duel  (more  than  I  vs  1 )  the  units  of  A  concentrate  their  fire  on  a  single 
unit  of  B  while  the  units  of  B  each  fire  at  a  different  unit  of  A.  or  spread  their  fire  over  all 
available  units  of  A,  this  last  case  occurs  when  M  <  ,V  B  has  to  allow  an  amount  of  concen¬ 
tration  in  order  not  to  lose  some  shots.  Concentration  will  be  kept  at  a  minimum  to  preserve 
as  much  parallelism  as  possible.  For  example  if  M  =  3  and  .V  =  7  the  pattern  of  fire  for  B  has 
to  be 
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7.  The  most  general  notation,  for  example,  V 4llj(M,N,K 4,KH,p4  pH)  will  be  avoided  it 
possible  and  replaced  by  an  appropriate  simpler  form. 


Before  proceeding,  a  general  remark  ought  to  be  made:  most  of  the  difficulties  come  from 
the  asymmetry  in  the  situation  and  from  the  interaction  between  the  opponents.  The  same 
model  has  to  express  concentration,  dispersion  and  partial  concentration  of  fire.  Moreover,  the 
probability  of  winning  depends  upon  the  whole  past  history  of  the  duel. 

3.  MULTIPLE  DUEL.  ONE  HIT  SUFFICIENT  TO  DESTROY 

Let  Ka  =  Kg  =  1  and  let  EU.j)  be  the  state  of  group  A  with  /  units,  and  of  group  B  with 
J  units. 


If  A  fires  first,  the  next  state  is 

EU.j )  with  probability  q'4  and 
EU.j  -  1)  with  probability  1  -  q'4. 


When  B  fires,  let  us  first  consider  the  case  when  j  <  i. 

EU.j )  becomes  EU  -  k.j).  k  —  0 . j  with  probability 

and 


Then, 


E ( i.j  -  1)  becomes  EU  —  k.  j  -  I),  k  —  0 . j  —  1  with  probability 


k 


If  on  the  other  hand  j  ^  /  some  regrouping  has  to  be  done. 


Let  j  -  ai  +  b  with  b  <  r.  a.  b  €  /*.  The  regrouping  which  spreads  the  fire  the  most  is 
given  by 


a  shots  are  fired  with  a  probability  of  success 
I  -  (1  -  pB)a  —  1  -  A0  at  each  of  i  —  b  targets 


a  +  I  shots  are  fired  with  a  probability  of  success 
1  -  (1  -  pB)a  +  1  -  1  -  A,  at  each  of  b  targets. 


Define  r  -  min(i,y  ).  Then  both  cases  j  <  /  and  j  ^  /  are  identical  if  one  defines  the  probabil¬ 
ity  of  transition  from  state  EU.j)  to  slate  EU  -  k,  j)  when  B  fires  as 


(3.1)  9U,  j.  k,  pB) 


I  ,r  -h 

*,-0.1.  h 


b  -  k i  *0  r  -  b  -  A„ 

0  -  *o>  A 
A  0 


In  the  case  j  <  i,  a  -  0,  kn  —  0  and  k{  -  k. 


684 


\  /IM.I  K 


r 


It  follows  that  if  A  starts  and  B  returns  fire  once,  the  intial  state  £(/',  j)  can  become 
£(/,  j)  with  probability  qA  0  (/,  j,  0,  pB)  =  qA  qfa 

E(i  —  A,  j )  with  probability  q\  0  (/.  j.  k.  pH).  k  =  I . r 

E(i  —  k,j  —  1)  with  probability  (I  —  qA)  0  (/.  j  —  1,  A,  pH>.  k  =  0,  I . r‘ 

where  r'  =  min  (/,  j  —  1). 

If  B  starts,  the  initial  state  £(/,  j)  can  become 

£(/,  j)  with  probability  qA  0  (/,  j,  0 ,/>«)  =  qA  qjj 

E{i  —  k,  j)  with  probability  qAk  0( i.j.k.Pn > ■  A  =  1 . r 

E(i  —  A.  j  —  1)  with  probability  (1  —  qAk)  0  (/,  j.  A,  pH).  A  =  0,  I . r" 

where  r"=  mini/  —  1,  j). 

Let  Vg\A(M,  N)  denote  the  probability  that  group  B  wins  with  initial  slate  £ < A/,  AO  and 
A  starts  firing.  Then 

(3.2)  Vb\aM.  AO  =  qy  q%  Vhu(M.  N) 

+  Qa  £  V  (W.  A/.  k.  pH)  VHU( M  -  k,  AO 

A-l 

+  (1  -  </.f)  £  0(M.N-  I,  A.  />*)  ^(A/  -  A.  A/  -  I). 

*-o 

This  corresponds  to  a  decomposition  into  all  the  mutually  exclusive  and  exhaustive  ways  for  B 
to  win  if  A  fires  once  and  then  B  returns  fire. 

In  a  similar  way 

(3  3)  AO  -  q*fq%  VB]H  (M,  AO 

+  £  qV  k  0  (M.  N.  A,  pu)  Lfli/jIA/  -  A,  AO 

A-l 

+  £  (1  "  «"“*>  N'  k’PH)  ~  k-  V  ~  ')• 

Since  we  have 

VH  4(M.  0)  =  I’bihIM.  0)  =  0  all  M 
and 

T»  ,(0,  AO  -  T„|S(0.  /V)  -  1  all  N 

we  can  calculate  in  succession  all  required  probabilities  For  example,  since  0  11,1,1.  p„)  -  pH, 
one  finds  Fau(l,  1)  -  q4pB/ (1  -  qAqB).  Using  Ts*(l,l)  and  0(1, 1.0.  p„)  -  qH, 
00,2.1.  pK)  -  I  -  q, i  one  finds  F'W)4<1. 2) 


Fxplicitly,  one  gets,  by  assuming  that  A  starts  half  the  time. 
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One  can  also  obtain  for  qA  =  qB  =  q 

v  (2  2)  =  1  +  4<?  4-  V  +  lq3  +  4 q*  +  iqs  +  qb 
B  ’  2(\  +  q)2  (l  +  q2)  (\  +  q  +  q2) 


A  comparison  with  the  triangular  duel  and  the  first  square  duel  [2]  for  p  —  0,  q  —  I 

gives 

y  (2  i)  _  1  EiaSl  ±  _  £ll±  q2)  _ ^./6 

B  2(1-  <j2)(l  -  q 3)  2(1  +  fl)(l  +  <?  +  <72)  v-i 

and  ^(2.2) - - 1/2  which  are  the  same  limits  as  the  one  obtained  from  Equation  29  and  37 

Q  l 

in  [2], 


Table  1  gives  some  results  for  VB(M.  N.  p4,  pa). 

TABLE  1  -  (x  104) 


M 

N 

Pa  0.3 

Ph  0.3 

0.3 

0.5 

0.5 

0.3 

0.5 

0.5 

0.7 

0.5 

0.5 

0.7 

0.7 

0.7 

1 

1 

5000 

6538 

3462 

5000 

3824 

6176 

5000 

2 

2 

5166 

7307 

3100 

5317 

3850 

6873 

5447 

3 

3 

5678 

8227 

3405 

6418 

5081 

8343 

7386 

3 

5 

9634 

9982 

8869 

9913 

9805 

9998 

9994 

5 

3 

1292 

3806 

0368 

1780 

0997 

3907 

2832 

5 

5 

7258 

9614 

5118 

8940 

8359 

9920 

9848 

5 

7 

9831 

9999 

9422 

9994 

9986 

10000 

10000 

7 

5 

3418 

7843 

1626 

6060 

5075 

9090 

8629 

7 

7 

8850 

9978 

7538 

9919 

9853 

10000 

10000 

10 

10 

9900 

10000 

9708 

10000 

10000 

10000 

10000 

It  should  be  noted  that  if  pA  =  pB  -  p  and  M  -  N  then  VB  ^  y  and  increases  with  p  or 
with  M  We  conclude:  Par,illel firing  is  better. 

No  simple  relationship  exists  in  the  case  pA  *  pH  Neither  MpA  vs  NpH ,  nor  M2pt  vs 
N2pB  are  sufficient  to  decide  if  VB  >  y 

4.  SIMPLE  DUEL.  K  HITS  NECESSARY  TO  DESTROY 

Let  M  -  N  -  1  and  let  ^B\A^KA.KB)  denote  the  probability  that  B  wins  the  simple  duel 
if  A  starts  firing  and  K4  hits  are  necessary  to  destroy  A  and  KB  for  B 

It  is  evident  that 

Vbu{Ka.Kb)  -  PA  VB\B(h iA.  Kb  -  1)  +  qA  Vb\b(Ka,  Kb) 
and 


yBlB(KA,  kb)-  PBv„\4u<4 


1.  Kb)  +  Qb  KB). 
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This  gives 

(4.1)  (1  ~  Qa  Qb )  KbU (Ka,  Kg)  —  pA  pg  Vb\a(Ka  ~  Kg  —  1) 

—  Pa  Qb  Vb\a(Ka,  Kg  —  1)  —  qA  Pg  Ybia(Ka  —  1.  Kg)  —  0. 


In  order  to  solve  this  difference  equation,  following  Boole  [51,  let  us  define 
x-KA.  y-Kg 
ux.y  “  vb\a(x  -  1.  y  -  1) 

Dxu  -  11,+, j,  and  Dyu  -  uxy+i. 

Substituting  these  into  Equation  (4.1)  we  get 

1(1  -  Qa  Qb)Dx  Dy  -  pApB-  pA  Qg  Dx  -  qA  pB  Dy)u  -  0. 

Let  Dv  =  a. 

((1  “  Qa  Qb^o  ~  Pa  Qb)Dxu  -  pB(aqA  +  pA)u 

which  gives 

«  “  pUpa  +  Qa  Dy)x  Id  -  qA  qB)Dy  -  pA  qBVx  9  (y) 
where  9(y)  is  arbitrary.  Then, 

«  -  pAt  Pa  Qa‘  DxA  0  -Qa  Qb rxDy~x  1  "  JZ~^D>7' 


Since  Dv  x9(y)  —  9(y  -  x)  and 

[,- 


PaQb  n_, 

-x 

oo 

_  v 

x  +  j  -  1 

Pa  Qb 

1  —  QaQb 

“  z. 

y-o 

j 

1  -  QaQb 

D:> 


we  get 


Pb 


1  ~  QaQb 
Taking  into  account  that 

W 1. 1) 


II 

i-0  y— 0 


A  I x  +  j  -  1 

j 


PajQa  ‘Qb  (1  -  Qa  Qb )  j  9(y  - 


Pb  Qa 


1  ~  QaQb 

a  good  choice  for  9(t)  is 

9(i)  -  1  if  t  >  0 
—  0  if  /  0. 


Defining  r  -  min(/f, ,  Kg  -  1)  the  solution  becomes 

KA  +  j  -  l|  Ka  Ka  -  i 


r  1  ~ ^  I  Ka 

(4.2)  yB\A(KA,Kg)~Z  I 

i—n  /-o  ' 

with 


\PajPb  q<  ?Ad- 


VB\aIKa.  0)-0and  VBl4(0.  Kg)  -  1. 


•  O')- 


i-j). 


Qa  <?«> 
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One  can  verify  by  substitution  that  this  is  a  solution. 

One  can  evaluate  the  other  probabilities  of  winning  by 

Ka\ b^Ka,  Kb.  pA,  pB )  =  Vb\a(Kb-  Ka,  Pb,  pA). 
^b\b^-Ka.  Kb-  Pa-  Pb)  =  1  -  ^a\b^Ka,  Kb.  Pa-  Pb )• 

Va\aWa-  Kb.  Pa-Pb )  =  I  -  VB\a (*a-  Kb-  Pa-Pb>- 


Table  2  gives  some  results  for  Ybia(Ka.  KB.  pA,  pB)  and  VH^H(KA.  KH ,  p4  pB) 

TABLE  2  -  (x  I04) 


Ka 

Kh 

Pa  =*  -3 
V*\A 

Pb  =  -5 
I'filfl 

mm 

Pa  =  -5, 

^BlA 

Pb  =  7 

1 

1 

5385 

7692 

3333 

6667 

4118 

8235 

5 

3 

4257 

5010 

1139 

1728 

2576 

3579 

5 

5 

8201 

8630 

4512 

5488 

7414 

8381 

7 

5 

5955 

6541 

1674 

2266 

4159 

5278 

H 

mm 

8695 

8981 

4599 

5401 

7981 

8669 

19 

■El 

9160 

9330 

4671 

5329 

8545 

9002 

This  table  indicates  that  -  1/2  if  K4  «  K„  and  p4  «  pB  =  1/2,  VH  increases  towards  1 
if  K4  -  KB  and  pB  >  p4  and  I  VBt4  -  VB\B\  decreases  if  K4  and  KH  increase. 

An  interesting  comparison  is  to  be  made  with  the  results  given  by  Bhashyam  14]  Under 
an  assumption  of  an  exponential  distribution  for  interfiring  times  he  finds  that  the  probability  of 
B  winning  is,  using  our  notation, 

P(B)  -  1  -  /  „4  (K„,  K4) 

f>4*rg 

where  lx  is  the  incomplete  Beta  function  The  correspondance  in  the  notations  being  Kp  for  p4, 
A  *p*  for  pB,  R  for  KB  and  R‘  for  KA 

Table  3  shows  at  what  rate  a  model  with  alternate  firing  converges  towards  Bhashyam's 
model. 


Alternate  firing  gives  a  good  approximation  if  p  is  small.  In  fact,  consider  K4  and  Ks 
fixed  and  p4  —  c  pB  with  pB  —  0. 


One  can  show  that 

lim  VB\4  - 


_ ! _  y 

(I  +  e)K<  % 
and  this  limit  from  a  well  known  theorem  is 
1-/,  (Kb.K4). 

7*i 


Ka  +2-1 

j 


1  +  c 


-  lim  yB\B 


1 
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TABLE  3  —  Rate  of  Convergence  of  VB  to  P(B) 


Pa 

Pb 

Ka 

kb 

Yb 

P(B) 

0.4 

0.2 

5 

5 

0.1054 

0.2 

0.1 

0.1265 

0.02 

0.01 

0.1431 

0.002 

0.001 

0.1447 

0.1449 

0.1 

0.2 

5 

2 

0.3391 

0.01 

0.02 

0.3501 

0.001 

0.002 

0.3511 

0.3512 

0  1 

0.2 

10 

10 

0.9491 

0.01 

0.02 

0.9366 

0.9352 

5.  SQUARE  DUEL.  2  HITS  NECESSARY  TO  DESTROY 

Lei  M  —  N  —  2  and  K4  —  KB  —  2.  One  can  represent  the  state  of  the  two  forces  by  (/,, 
<2.  j\.  ji )  with  1 1 .  /j,  j\,  ji  —  0. 1,2,  representing  the  number  of  hits  necessary  to  destroy.  For 
example,  (1,  1;  0,  2)  means  that  A  has  2  units  that  can  be  destroyed  by  one  hit  each  and  B  has 
one  unit  that  has  been  destroyed  by  2  hits  and  one  unit  untouched. 

All  attempts  to  arrive  at  one  or  two  difference  equations  have  been  in  vain  Two 
equivalent  approaches  have  been  used.  In  the  first,  taking  pA  —  pB  -  1/2,  and  defining  A ,  as 
the  matrix  of  the  transitional  probabilities  corresponding  to  the  case  when  A  fires  first,  and  B 
the  corresponding  matrix  when  B  fires  first  one  obtains: 

V  4  by  summing  all  the  probabilities  for  the  events  < j\  0,  0)  in  lim  (AB)n  and  VB  by 

n  -  >  «> 

summing  all  the  probabilities  for  the  events  (0,  0,  /,  j)  in  lim  (BA)n. 

n  -  >oo 

The  matrices  are  29  x  29.  The  possible  states  of  A  are  such  that  <(  ^  /2.  The  possible 
states  of  B  are  such  that  j\  <  j2  and  exclude  j \  -  j2  —  l  since  A  concentrates  its  fire  until  des¬ 
truction  is  achieved. 

Assuming  the  ordering  it  ^  i2,  two  variations  are  possible  In  Case  1,  when  the  state  is 
(2,1;  0,  j)  with  j  —  1  or  2  and  B  fires,  B  chooses  at  random  among  the  two  units  of  A  In  Case 
2.  B  fires  on  the  second  unit  of  A ,  which  can  be  destroyed  by  one  shot.  We  find 

In  Case  1  Y4  “  0.5586 
and  in  Case  2  P4  -  0  5396 
In  both  cases  concentrated  firing  is  better. 

The  other  approach  consists  in  writing  down  all  the  equations  that  define  the  battle  For 
example, 

F4U(2.2.1.2>  -  (1  -  q})  F4lfl(2, 2,0,2)  +  q}  VA  lfl  (2.  2,1 , 2). 

The  difference  between  Case  1  and  2  is  seen  by  considering 

F4I8(2.1;0.1)  -  0.5  p„  VVA{\.  1.0. 1)  +  0  5 p„  VA  4(2,0;0,1) 

+  q„  V4i4(2,  1;0, 1 ) 
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or 

^aiiO.l )  =  pB  ^u(2,0;0 ,\)  +  qB  M (2,  I ;0.  I). 

A  third  variation  is  possible  in  which  no  ordering  is  assumed  for  the  iKs.  Only  the  states  with 
ii  -  0  are  eliminated.  In  this  case,  B  fires  always  upon  the  last  unit  of  A  but  2  states  are  con¬ 
sidered 

^l*(2.1;0.1>  =  pB  K4u(2.0;0.I)  +  qB  VaU( 2, 1,0,1) 

and 

Ka ,«< !■  2;0,  \  )  =  pB  VAU(\.\,0.\)  +  qB  VA\A  (2,  1;0,  1). 

In  this  case  VA  =  0.5553  for  pA  =*  pB  =  0.5. 

The  total  system  consists  of  35  pairs  of  equations  and  is  solved  by  iterations. 

Table  4  gives  some  results  for  the  square  duel  in  this  last  case.  As  in  the  two  preceeding 
cases,  concentrated  firing  is  better. 

An  extension  of  this  last  case  is  considered  in  the  next  section. 

6.  MULTIPLE  FA  R  DUELS.  K  HITS  NECESSARY  TO  DESTROY 

Let  us  rest  ct  ourselves  to  the  case  of  a  fair  duel,  i.e.,  one  such  that  M  *  N  =  n. 
Pa  -  Ph  =  P  and  KA=*  KB~  A 

All  nondestroyed  units  of  A  concentrate  their  fire  on  a  single  unit  of  B ,  volley  after  volley 
until  destruction  is  achieved.  For  the  next  volley  they  concentrate  their  fire  on  the  next  undes¬ 
troyed  unit  of  B 

There  are  wA  +  1  possible  states  for  B 

A.  A . A 

A  -  I  A' . A 

1,  A,  ...  A 

0,  A . A 


0,  0 . 0. 

On  the  other  hand  B  spreads  its  fire  over  all  units  of  A  and  ail  states  are  possible,  eliminating 
only  the  destroyed  units. 

Since  there  are  A"  1  different  states  with  j  zeros  the  number  of  possible  states  for  A  is 
(A"*1  -  1 )/( A  -  1). 

This  means  that  in  order  to  find  VA(K . A,  A . A)  we  will  have  to  solve  a  linear  sys¬ 

tem  consisting  of  (nK  +  II  (A"*1  —  I )/ (A  —  I )  pairs  of  equations  of  the  form 

rAlA  ( state )  -  linear  combination  of  VA\B  (outcome  of  A  firing) 

F'a  «  (state)  -  linear  combination  of  VA \A  (outcome  of  B  firing) 


i 
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TABLE  4  —  (x  104)  Square  Duel 


Pa  =  Pb  =  P 

Va\a 

*a\b 

Va 

Number 

of 

Iterations 

0.999 

9980 

40 

5010 

3 

0.99 

9809 

382 

5096 

3 

0.95 

9193 

1599 

5396 

4 

0.9 

8666 

2573 

5620 

5 

0.7 

7573 

3850 

5712 

8 

0.5 

6781 

4324 

5553 

13 

0.3 

6117 

4786 

5452 

25 

0.1 

5598 

5198 

5398 

80 

0.05 

5487 

5292 

5389 

157 

0.025 

5434 

5337 

5385 

303 

0.02 

5423 

5346 

5385 

373 

0.01 

5402 

5364 

5383 

844 

Unfortunately,  the  number  of  possible  states  increases  very  rapidly.  A  few  values  are 

given: 

Number  of  States 

K  —  2  n  =  2  35 

3  105 

4  279 

5  693 

6  1651 

K  =  3  n  -  2  91 

3  400 

4  1573. 

This,  however,  is  much  better  than  (A'  +  1)’",  which  is  the  number  of  possible  states  without 
any  restrictions. 

S’nce  writing  down  the  necessary  equations  is  an  impossible  task,  a  computer  program  was 
written  to  build  the  equations  and  solve  them  by  iteration.  The  main  steps  are: 

( 1 )  define  the  necessary  states, 

(2)  define  4  ”  0  for  all  states 

f '4  lfl  “  0  for  all  states  if  B  is  not  destroyed 
H  =  I  if  B  is  destroyed. 

These  will  be  the  initial  conditions. 

(3)  For  each  state  determine  the  number  of  effective  units  M4  and  \'H.  If  A  fires,  the 
number  of  targets  is  T  =  I  and  the  degree  of  concentration  is  c  «  M4.  If  B  tires,  the  number 
of  targets  is  7=  minf/Vf*.  NH).  If  M4  ^  SH,  the  degree  of  concentration  is  c  =  1  and  if 
>  M4,  then  A#  -  a  M4  +  b  and  <  |  —  a  for  T\  *  M4  -  b  units  and  c?  -  a  +  1  for  T:  -  b 
units 
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(4)  Let  Qc(i,  j)  denote  the  probability  for  a  unit  to  go  from  state  K  =  /  to  state  K  =  j  if 
submitted  to  fire  of  concentration  c.  Then  the  matrix  Q 2,  for  example,  has  the  form 


0  12  K 


In  general,  for  1  =  1,  K  and  j  =  0, 1 . K 

J  P~‘  QC  I+I  for  j  *  0 

QCU.  j)  =  k 

1  ~  £  (?<■('»  J )  for  J  “  O' 

/-! 

All  required  matrices  are  constructed. 

5)  For  each  state  the  equation  giving  VA\A  is  constructed. 

Let  i  denote  the  state  of  the  target  unit. 

Let  j  denote  the  states  of  this  unit  after  A  has  fired,  the  rest  of  B  being  unaffected 

Then, 

r«i 4UU)  -  I  0Mt(U)  V^b(A  J) 

/ 

the  corresponding  equation  for  VA\B  is  of  the  general  form 

■W'V‘2 . *>;«)-  I  ri  Q^ieJe)  VA  la  Ol, . j^B). 

Jv  J 

For  example, 

K.i.U, 2.0.0, 0;1. 2.2.2.21  -  X  02(U|)0j(2j,)  VaU  0,  j2.0.0.0;l.  2. 2,  2.  2). 

/  j-11 1  ? 

6)  When  all  possible  states  are  gone  through,  the  last  calculated  value  is 

Vvb(K.K . K,  K . K). 

It  is  compared,  usually  within  10  h,  to  the  previously  calculated  value  and  the  process  is 
iterated  until  convergence  is  achieved. 

Table  5  gives  results  for  several  values  of  M  and  K.  The  dimension  of  the  linear  system 
is  twice  the  number  of  states.  The  probability  of  a  hit  is  taken  as  p  -  0  5.  Time  is  given  for 
some  cases  The  computer  used  was  a  CDC6400 

The  value  p  -  0  5  was  chosen  because  time  increases  very  fast  if  p  decreases,  as  is  seen 
from  Table  4 


A  ZINGER 


TABLE  5  —  (VA  x  104),  Multiple  Fair  duel.  Exact  Results 


M~N  \  K 


Number 

of 

Equations 

Number 

of 

Iterations 

Va 

Time 
(in  secon 

70 

13 

5553 

182 

15 

5988 

378 

17 

6364 

682 

19 

6661 

210 

13 

5537 

800 

15 

6211 

2210 

17 

6822 

4992 

19 

7289 

1700 

558 

13 

5152 

3146 

15 

6132 

11594 

17 

6872 

6141 

1386 

12 

4429 

11648 

15 

5931 

8960 

Since  exact  calculations  of  VA  become  too  time  consuming,  some  results  were  obtained  by 
simulation.  Table  6  gives  some  results.  The  number  of  trials  was  2000  for  p  ^  0  5  and  6000 
for  p  —  0.5,  A  started  the  duel  in  half  the  cases. 

TABLE  6  -  (VA  x  103), 

Multiple  Fair  Duel.  Simulation  Results 

P  0.1  '  0.3  |  0.5  |  0.7  |  0.9  | 

M  I  K 


0.1 

0.3 

0.5 

0.7 

550 

542 

561 

569 

574 

562 

522 

485 

583 

490 

351 

148 

580 

382 

120 

2 

562 

249 

11 

0 

600 

600 

611 

622 

652 

632 

599 

628 

672 

606 

564 

494 

705 

554 

444 

181 

725 

482 

233 

6 

586 

616 

642 

691 

715 

694 

685 

666 

774 

700 

653 

672 

796 

684 

608 

548 

797 

643 

524 

204 

630 

646 

674 

708 

754 

753 

760 

748 

812 

786 

725 

665 

838 

777 

668 

698 

878 

740 

639 

594 
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We  note  that  for  large  values  of  p  the  behaviour  of  VA  is  erratic.  This  is  due  to  the  deter¬ 
ministic  issue  of  a  battle  for  p  =  1  as  a  consequence  of  alternative  firing.  For  example,  if 
M  =  6,  k  =  2  and  A  starts  firing,  the  sequence  of  states  is  B.  022222,  A:  211111,  B  002222,  A: 
210000,  B :  000222,  B  wins. 

Two  independent  estimates  of  the  error  can  be  made;  one  by  comparing  the  results  of  the 
simulation  with  the  calculated  values  in  Table  5  for  p  =  0.5,  M  =  jV  =  2  or  4  and  A  =  2,3,4, 
giving  s  =  0.0093,  the  other  estimate  is  given  by  assuming  a  binomial  distribution  with  6000 
trials  giving  s  =  0.0065.  To  be  on  the  safe  side  one  can  conclude  that  concentrated  firing  is 
better  if  the  simulation  gives  VA  ^  0.519  and  parallel  firing  is  better  if  the  simulation  gives 
V A  <  0.481.  This  does  not  take  into  account  the  bias  introduced  by  alternate  firing  for  "large" 
values  of  p.  Since  the  sign  of  the  bias  is  evident,  one  can  adjust  one’s  conclusions,  for  example 
for  M  =  10,  A  =  4  and  p  =  0.5  the  observed  value  0.524  is  pulled  down  and  almost  certainly  A 
wins  more  often  than  B.  On  the  other  hand  for  M  =  8,  A"  =  3  and  p  =  0.5  the  value  0.444  is 
certainly  pulled  down  and  one  can  hardly  conclude  that  B  wins  more  often. 

Table  7  summarizes  all  the  results  obtained. 


TABLE  7  —  Better  Strategy  of  Firing 


Concentrated 

Parallel 

Border  cases 

p  =  0.1 

A  >  2 

A  =  1 

p  =  0.3 

A  =  3  2<A/<4 

A  =  2  M  >  7 

A  =  2  M  =  5  or  6 

A  =  3  2<A/<8 

A  =  3  M  -  9  or  10 

A  =  4, 5  2<A/<at  least  10 

p  =  0.5 

A  -  2  2  <  M  <  3 

A  =  2  M  >  5 

A  =  2  M  -  4 

A  =  3  2s$M<6 

A  =  3  /W  Js  7 

A  -  4  2  <  M  «£  10 

A  -  5  2  <  M  <  at  least  10 

One  can  conclude  that  concentrated  firing  is  better  if  the  combination  of  group  size  and 
hit  probability  does  not  produce  a  high  degree  of  overkilling.  For  A  >  2  a  rough  rule  could  be 
concentrate  firing  if  pM  <  K  (the  exception  is  p  =  0.5,  A  =  4  and  M  =  9  or  10). 

Up  to  this  time  we  have  compared  two  strategies:  parallel  firing  and  concentrated  firing. 
In  the  next  section  we  will  attempt  to  define  the  concept  of  partial  concentration. 

7.  MULTIPLE  FAIR  DUELS.  PARTIAL  CONCENTRATION  OF  FIRE 

Let  M  =  N  =  n,  pA  =  pB  —  p  and  KA  =  KB  =  A.  Let  cx  be  the  maximal  number  of  non- 
destroyed  units  of  A  that  are  allowed  to  concentrate  their  fire  on  a  single  unit  of  B ,  volley  after 
volley  until  destruction  is  achieved. 

If  c*  =  1,  A  uses  parallel  firing  in  the  same  manner  as  B  If  cx  -  n,  A  uses  concentrated 

firing 


Under  partial  concentration  the  number  of  targets  for  A  is  given  by  the  integer  function 


Ta 


n  +  cx  -  1 
cx 
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A  /INCil  K 


and  the  number  of  possible  states  for  B  is 

T  —  1 

(n  -  Ta  +  1  )KTa  +  - 

A  —  1 

which  reduces  to  nK  +  1  for  TA  =  1  and  (Kn+i 
linear  equations  to  be  solved  becomes 


2 


a:^1  -  i 

K  -  1 


(n  —  TA  +  1)  KT*  4- 


-  \)/(K  -  1)  for  Ta 


K  -  1 


n.  The  number  of 


The  value  cx  =  1  (TA  =  n)  was  used  to  determine  the  precision  of  the  obtained  results, 
since  VA  —  0.5.  For  p  =  0.5  the  maximum  error  found  was  3  x  10-5  and  for  p  =  0.1  it  was 
1  x  10~4. 

Table  8  gives  some  calculated  results  for  p  =  0.5. 

TABLE  8  —  (VA  x  104) ,  Partial  Concentration 


A/=  N 

K 

cx 

Number 

of 

Equations 

Number 

of 

Iterations 

Va 

3 

2 

2 

330 

13 

5404 

3 

2 

1760 

15 

5950 

4 

2 

6290 

16 

6412 

4 

2 

2 

930 

12 

5639 

3 

930 

13 

5192 

3 

2 

7502 

14 

6304 

3 

7502 

15 

6127 

5 

2 

2 

3906 

12 

5541 

3 

2394 

12 

5311 

4 

2394 

12 

4523 

Comparing  the  results  of  Table  5  and  Table  8,  one  sees  that  partial  concentration  with 
cx  —  2  is  better  than  total  concentration  for  the  cases  M  =  4  and  k  =  2  or  3  and  any  partial 
concentration  is  better  for  the  case  M  =  5  and  k  =  2.  Further  investigations  are  needed. 

8.  SUMMARY 

The  proposed  model  is  an  idealization  of  combat  between  small  groups  of  individual 
identical  firers  and  is  very  far  from  the  very  complicated  process  of  real  combat.  However,  it 
has  provided,  through  the  use  of  alternate  firing  as  an  expression  for  the  interaction  between 
opponents,  some  important  insights  into  combat  dynamics  that  could  be  further  investigated 
with,  for  example,  a  high-resolution  Monte  Carlo  simulation.  It  has  been  shown  that  alternate 
firing  gives  the  same  results  for  small  hit  probabilities  as  some  previously  developed  models.  It 
has  also  been  shown  that  the  relationship  between  the  size,  the  hitting  capacity  and  the  resis¬ 
tance  of  the  opponents  is  a  complex  one  and  that  concentrated  firing  is  better  than  alternate 
firing  if  the  amount  of  over-killing  is  not  too  high.  Moreover,  some  evidence  suggests  that  par¬ 
tial  concentration  can  be  even  more  effective. 
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ABSTRACT 

During  basis  reinversion  of  either  a  product  form  or  elimination  form  linear 
programming  system,  it  may  become  necessary  to  swap  spike  columns  to  effect 
the  reinversion  and  maintain  the  desired  sparsity  characteristics  This  note 
shows  that  the  only  spikes  which  need  he  examined  when  an  interchange  is  re¬ 
paired  are  those  not  yet  processed  in  the  current  external  bump 


I.  INTRODUCTION 


An  important  component  of  a  large  scale  linear  programming  system  is  the  reinversion 
routine.  This  paper  addresses  an  important  ancillary  technique  for  implementing  a  reinversion 
routine  utilizing  the  pivot  agenda  algorithms  of  Hellerman  and  Rarick  15.6] .  Production  of  fac¬ 
tors  during  reinversion  typically  involves  a  left-to-right  pivoting  process.  Unfortunately,  during 
the  left-to-right  process,  a  proposed  pivot  element  of  a  spike  column  may  be  zero,  in  which 
case  columns  are  interchanged  in  an  attempt  to  obtain  a  pivotable  column  while  maintaining 
desired  sparsity  characteristics.  In  this  paper  we  show  that  the  only  columns  which  need  be 
considered  for  the  interchange  with  a  nonpivotable  spike  are  other  spikes  lying  to  the  right 
within  the  same  external  bump. 

II.  PRODUCT  FORM  OF  THE  INVERSE 


Let  B  be  any  m  x  m  nonsingular  matrix.  One  of  the  most  common  factorizations  for  B  1 
is  the  product  form  which  corresponds  to  the  method  for  solving  a  system  of  linear  equations 
known  as  Gauss-Jordan  reduction  (see  [3,  4)).  This  procedure  is  used  to  represent  B  1  (or  a 
row  and  column  permutation  of  B  l)  as  the  product  of  matrices  each  of  the  form 


/ 

z  = 

Z 

/ 

‘This  research  was  supported  In  part  by  the  Air  force  Office  of  Scientific  Research  under  Contract  Number  AT  OSH 
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where  z  is  an  m-component  column  vector,  and  j  is  called  the  pivot  row.  A  few  observations 
concerning  Z  are  obvious. 

PROPOSITION  1 :  Z  is  nonsingular  if  and  only  if  Zj  ^  0. 

PROPOSITION  2:  Let  p  be  any  m-component  vector  having  Pj  -  0.  Then  Zp  -  p. 

PROPOSITION  3:  Let  p  be  any  m-component  vector  having  Pj  ^  0,  and  let  eJ  denote 
the  vector  having  yth  component  1  and  all  other  components  zero. 


Let  zk 


~PjPj-  if  *  *  J\ 
Up),  if  k  -  j 


.  Then  Z/3  -  ej. 


Let  BO)  denote  the  rth  column  of  the  matrix  B.  Consider  the  following  algorithm. 

ALG  1:  Product  Form  Factorization 

0.  Initialization 


Interchange  columns  of  B,  if  necessary,  so  that  the  first  component  of  B(\ >  is  nonzero 
Set  /  —  1,  B  5(1),  and  go  to  3. 

1 .  Update  Column 
Setp—  ...  E'BU). 

2.  Swap  Columns  If  Pivot  Element  Equals  Zero 

If  p,  ^  0,  go  to  3;  otherwise,  there  is  some  column  B(j)  with  j  >  i  such  that  the  fth 
component  of  y  -  E'~l  . . .  E'B(j)  is  nonzero.  Interchange  Bij)  and  BU)  and  set  P  —  y 

3.  Obtain  New  Elementary  Matrix 


Set 


2k 


E' 


1  /Pt,  for  k  —  i 
-pk/Pi,  otherwise. 


/  th  row 


4.  Test  for  Termination 


If  /  -  m,  terminate;  otherwise,  i  —  i  +  1  and  go  to  1.  At  the  termination  of  ALG  1, 
Em  . . .  £'  is  a  row  permutation  of  B~'. 

In  the  following  two  propositions  we  show  that  if  in  Step  2,  p,  -  0,  then  the  proposed 
interchange  is  always  possible.  Consider  the  following. 

-  e1. 


PROPOSITION  4:  For  /  ^  f  EC..  E'BU) 
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PROOF:  By  the  construction  of  £'  and  Proposition  3,  .  £'£</)  =  e‘.  By  Proposition 

2,  EJ ...  r',  +  1e'  =  e‘.  So  EJ ...  E'B(i)  —  e‘.  Using  Proposition  4  we  may  now  show  the  fol¬ 
lowing: 

PROPOSITION  5:  For  2  <  /  $  m,  let  (i  =  E‘~ 1  ...  £'£(/).  If  fi,  =  0,  there  is  some 
j  >  /  such  that  l£,_l  ...  EXB(J)],  ^  0. 

PROOF:  Suppose  [£'  1  ...  E^B(J)],=  0  for  all  j  >  i.  By  the  construction  of 
£',  ...  £'"’,  in  ALG  1,  and  Proposition  I,  each  factor  is  nonsingular.  Since  B  is  nonsingular, 
£'  1  ...  E'B  is  nonsingular.  By  Proposition  4,  £'_l  ...  E]B(J)  =  eJ  for  I  ^  j  <  /  -  1 
Hence,  the  /th  row  of  £'-1  ...  £'£  is  all  zero,  a  contradiction. 

III.  BUMP  AND  SPIKE  STRUCTURE 

In  order  to  minimize  the  core  storage  required  to  represent  the  ETA  file,  i.e., 
£l . £"',  the  rows  and  columns  of  B  are  interchanged  in  an  attempt  to  place  B  in  lower  tri¬ 
angular  form.  If  this  can  be  accomplished,  then  the  m  nonidentity  columns  of  £' . £m, 

have  the  same  sparsity  structure  as  B  Consider  the  following  proposition: 

PROPOSITION  6:  If  the  first  j  -  1  components  of  B(j)  are  zero  for  j  >  2,  then 
E1'  ...  E'B(j)  =  B(j). 

PROOF.  This  follows  directly  from  successive  application  of  Proposition  2.  Therefore,  if 
B  is  lower  triangular,  the  factored  representation  of  B  1  may  be  stored  in  approximately  the 
same  amount  of  core  storage  as  B  itself.  In  practice  it  is  unneccessary  to  calculate  the  elements 
\/Bk  and  -fiJBk  in  Step  3  of  ALG  1.  It  suffices  to  store  k  and  the  elements  of  fi,.  It  may 
prove  advantageous  to  store  1/0;,  in  addition.  If  Proposition  6  applies  for  B(k).  then 
B  =  B(k)  and  the  only  additional  storage  required  is  for  the  index  k  (and  possibly  I//3*) 
Clearly,  this  results  in  substantial  core  storage  savings  compared  to  storing  B  1  explicitly. 

If  B  cannot  be  placed  in  lower  triangular  form,  then  it  is  placed  in  the  form: 


where  B]  and  are  lower  triangular  matrices  with  nonzeroes  on  their  diagonals.  We  assume 
that  if  B2  is  nonvacuous.  every  row  and  column  has  at  least  two  nonzero  entries,  so  that  no 
rearrangement  of  B 2  can  expand  the  size  of  B 1  or  B\  B 2  is  called  the  bump  set  non,  the  merit 
section  or  the  heart  section  We  further  require  the  heart  section  to  assume  the  following  form 


hr  _ 
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where  Gk's  are  either  vacuous  or  lower  triangular  with  nonzeroes  on  the  diagonal.  The  only 
partitions  in  B  having  columns  with  nonzeroes  above  the  diagonal  are  the  £*’ s  which  are  called 
external  bumps.  The  columns  extending  above  the  diagonal  are  called  spikes  or  spike  columns. 
An  external  bump  is  characterized  as  follows: 

(i)  the  last  column  of  an  external  bump  will  be  a  spike  with  a  nonzero  lying  in  the  top¬ 
most  row  of  the  external  bump,  and 

(ii)  the  nonspike  columns  have  nonzero  diagonal  elements. 

The  algorithms  of  Hellerman  and  Rarick  (5,6]  produce  such  a  structure  for  any  nonsingular 
matrix,  and  we  shall  call  a  matrix  having  this  structure  an  HR  matrix.  It  should  be  noted  that  if 
one  applies  ALG  1  to  an  HR  matrix,  then  the  only  columns  which  may  require  an  interchange 
are  spike  columns.  We  now  prove  that  the  only  columns  which  need  be  considered  for  this  inter¬ 
change  are  other  spikes  in  the  same  external  bump. 

Consider  the  following  result: 

PROPOSITION  7:  Let  BU)  with  i  >  2  correspond  to  the  first  column  of  some  external 
bump,  Fk ,  and  let  B(j)  be  a  spike  in  Fk.  Then  £'"'  . . .  E'Bij)  =  B(J). 

PROOF:  Note  that  the  first  i  —  1  components  of  Bij )  are  zero.  Therefore,  by  successive 
application  of  Proposition  2,  the  result  is  proved. 

Note  that  Proposition  6  allows  one  to  eliminate  all  of  the  calculation  required  in  Step  1  of 
ALG  1  for  nonspike  columns  and  Proposition  7  allows  one  to  eliminate  some  of  this  calculation 
for  spikes.  We  now  address  the  issue  of  spike  swapping.  Consider  the  following  propositions. 

PROPOSITION  8:  Any  spike  B(j)  which  is  not  pivotable  cannot  be  interchanged  with  a 
spike  Bik),  k  >  j,  from  another  external  bump,  to  yield  a  pivotable  column. 

PROOF:  Since  Bik)  is  from  an  external  bump  lying  to  the  right  of  the  external  bump 
containing  Bij),  Bfk)  =  0.  By  repeated  application  of  Proposition  2,  F1  1  ..  F 
Bik)  =  Bik)  Thus  B(j)  cannot  be  interchanged  with  B(k )  to  yield  a  pivotable  column. 

PROPOSITION  9:  Any  spike  Bij)  which  is  not  pivotable  cannot  be  interchanged  with  a 
nonspike  column  Bik),  k  >  j,  to  yield  a  pivotable  column. 

PROOF:  Let  ^(A:),  with  A  >  j  correspond  to  any  nonspike  column.  From  Proposition  6. 
£'“'  ...  E]B(k)~  B(k).  Since  the  y'th  component  of  Bik)  is  zero,  B(j)  cannot  be  inter¬ 
changed  with  Bik),  to  yield  a  pivotable  column.  We  now  present  the  main  result  of  this  note 

PROPOSITION  10:  Any  spike  column  Bij),  which  is  not  pivotable  can  be  interchanged 
with  a  spike,  Bik),  with  k  >  j  within  the  same  external  bump,  to  yield  a  pivotable  column 

PROOF:  If  Bij)  is  not  pivotable,  then  by  Proposition  5  there  exists  a  column  Bik  t  with 
k  >  j  which  is  pivotable  By  Proposition  8,  Bik)  cannot  be  a  spike  from  a  different  external 
bump  By  Proposition  9,  Bik )  cannot  be  a  nonspike  Hence  Bik )  must  be  a  spike  from  the 
same  external  bump 
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In  practice,  the  zero  check  in  step  2  is  replaced  by  a  tolerance  check.  Discussions  of  prac¬ 
tical  tolerance  checks  may  be  found  in  Benichou  (1 1 ,  Clasen  [2],  Orchard-Hays  (7),  Saunders 
18),  Tomlin  (9),  and  Wolfe  [10], 
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ABSTRACT 

—  ^  p  t>'>A(,)  <  - 

Let  Hit)  -  £  - — -  Hlk),  0  </<•».  where  A  U)h  is  nonde- 

_*-0  *'  _ 
creasing  in  r,  |/MA)i/a)  is  nonincreasing.  It  is  known  that  HU)  —  1  -  // ( r )  is 
an  increasing  failure  rate  on  the  average  (IFRA)  distribution.  A  proof  based 
on  the  IFRA  closure  theorem  is  given  Hit)  is  the  distribution  of  life  for  sys¬ 
tems  undergoing  shocks  occurring  according  to  a  Poisson  process  where  Plk)  is 
the  probability  that  the  system  survives  A  shocks.  The  proof  given  herein 
shows  there  is  an  underlying  connection  between  such  models  and  monotone 
systems  of  independent  components  that  explains  the  IFRA  life  distribution  oc¬ 
curring  in  both  models. 


1.  INTRODUCTION 

In  Barlow  and  Proschan  (1,  p.  931  a  fairly  general  damage  model  is  considered.  A  device 
is  subject  to  shocks  occurring  in  time  according  to  a  Poisson  process  with  raje  A  The  damage 
caused  by  shocks  is  characterized  by  a  sequence  of  numbers  (PUc)),  where  Pik)  is  the  proba¬ 
bility  that  the  device  will  survive  k  shocks.  The  Pik)'s  as  shown  in  { 1  ]  can  arise  in  different 
models.  For  example,  the  damage  caused  by  the  rth  shock  can  be  assumed  to  be  a  nonnegative 
random  variable  X,,  where  T|,  X2  •  •  ■  are  independent  and  identically  distributed,  failure  of  the 

k 

device  occurs  at  the  kth  shock  if  £  T,,  the  cumulative  damage,  exceeds  a  certain  thres- 

i- 1 

hold.  In  this  case  Pik)  =  Pr  X,  ^  yj,  where  y  is  the  threshold  Ross  (2)  has  failure  occur¬ 
ring  when  some  nondecreasing  symmetric  function  D(X\ . Xn)  first  exceeds  a  given  thres¬ 
hold;  i.e.,  DiX | . Xk)  is  a  generalization  of  £  Xr  Here,  Pik  )  —  Pr  (/3(T| . \\  )  ^  y). 

i- 1 

Let  HU)  denote  the  probability  that  the  device  survives  in  the  interval  |0,  tl  Then 

_  t  \  t)k  - 

HU)  -  y  - — Pik),  0  s?  t  < 

*  -  o  k ! 

In  Barlow  and  Proschan  (l!  (Theorem  3.6  p.  93)  it  is  proven  that  if  !P(A )'  *)  is  a  nomncreasing 
sequence  then  Hit)  -  1  -  Hit)  is  always  an  increasing  failure  rate  on  the  average  (IFRA) 

•Work  supported  in  part  by  the  Office  of  Naval  Research  under  Contract  N0OI4-75-O62O  and  the  National  Science 
Foundation  under  Cirant  No  MCS-7725-146  with  Columbia  University 
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distribution  function;  i.e.,  - - -  is  nondecreasing  in  t. 

Ross  12],  generalizes  by  allowing  the  Poisson  process  of  successive  shocks  to  be  nonho- 
mogeneous  with  rate  function  kit)  such  that 

Ait)  _  Jnx(i)</i 

i  t 

is  nondecreasing  in  t.  That  is,  the  same  assertion  can  be  made  when  Hit)  is  given  by 

—  °°  p  au) A  (i)k  - 

(1)  //(z)  =-  ' ~Pik),  0  t  <  co. 

k-0  "  • 


The  proof  given  in  [1]  is  based  on  total  positivity,  arguments.  Ross’s  technique  for  prov¬ 
ing  the  IFRA  result  is  obtained  by  making  use  of  recent  results  [3]  pertaining  to  what  he  calls 
increasing  failure  rate  average  stochastic  processes. 

Our  proof  below  shows  that  all  such  results  are  a  consequence  of  one  of  the  central 
theorems  of  reliability  theory,  the  IFRA  Closure  Theorem  (111  p.  83).  This  theorem  asserts 
that  a  monotone  system  composed  of  a  finite  number  of  independent  components,  each  of 
which  has  an  IFRA  life  distribution,  has  itself  an  IFRA  distribution. 

It  is  remarked  in  [1,  p.  91]  that  the  coherent  (or  monotone)  system  model  and  the  shock 
models  under  consideration  are  widely  diverse  models  for  which  the  IFRA  class  of  distribution 
furnishes  an  appropriate  description  of  life  length,  thus  reenforcing  the  importance  of  the  IFRA 
class  to  reliability  theory.  The  implication  of  our  proof  is  that  the  models  are  not  as  widely 
diverse  as  supposed. 

The  idea  of  the  proof  is  the  construction  of  a  monotone  system  (of  independent  com¬ 
ponents,  each  of  which  has  the  same  IFRA  life  distribution)  whose  life  distribution  approxi¬ 
mates  Hit).  The  proof  is  completed  by  allowing  the  number  of  components  in  the  system  to 
increase  in  an  appropriate  way  so  that  the  approximating  life  distributions  converge  to  Hit), 
the  IFRA  property  being  preserved  in  the  limit 

2.  APPROXIMATING  SYSTEMS  APPROACH 

For  each  m,  m  “  1 , 2  . . .  let  Sm  „,  n  —  1 , 2,  ...  be  a  monotone  system  of  n  independent 
components  Let 

( 1 )  Pmnik)  =  Pr  (no  cut  set  is  formed  |  exactly  k  components  of  S„  „  are  failed] 

where  all  of  the  n  components  are  equally  likely  to  fail.  (A  cut  set  is  a  set  of  components  such 
that  if  all  components  of  the  set  fail,  the  system  does  not  function).  Assume 

(2)  Pm  nik )  —  0,  if  k  >  m  for  every  n, 

(3)  lim  Pm„  ik)  -  Pmik),  for  every  k 

(4)  lim  P„ik)  —  Pik),  for  every  k. 


We  can  state 
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HU) 


THEOREM  1:  If  AU)  >  0,  — 

_  "  I 

-  1  -  HU)  given  by  (1)  is  IFRA. 


is  nondecreasing  and  (2),  (3)  and  (4)  hold,  then 


PROOF:  Assume  every  component  in  Sm„  is  independent  with  life  distribution 
LU)  -  1  -  e~A{,)/n.  Then  every  component  has  an  IFRA  distribution.  Let  Qm.„(k,t)  denote 
the  probability  that  exactly  k  units  fail  within  (0,  /].  That  is 


(5) 


Qm.n  (*.') 


-/*(<>!*  f  -Ait) 

l-e  "  J  U  " 


Let  Hmn(t)  denote  the  probability  that  Sm  „  works  for  at  least  t  units  of  time,  then 


(6)  Hm,„U)-  f  Qmn(k,t)Pm.M). 

k- 0 


By  the  IFRA  Closure  Theorem,  Hm„U)  is  IFRA. 


However, 

(7) 


HmU) 


lim  Hm„U) 


m 


I 

*-0 


lim  Qm„ik.t)  P„(k) 


y  e-A(,)A(r)k 

2L  jl  i 

A-0  K 

T  e~AU)A(t)k 

L I 

*-0  K  ■ 


Pmlk) 

PJk) 


by  (2),  the  Poisson  limit  of  binomial  probabilities,  and  (3).  Since  the  IFRAproperty  is 
preserved  in  the  limit,  H„U)  is  IFRA.  That  is,  since  HmU)  -  lim  Hm„it)  and 

_  _  /I— oo 

-(log  Hm  „U))/t  is  nondecreasing  in  r,  then  so  is  -(log  Hm(t))/t.  However, 
lim  H„(t)  -  £  — AU)k  lim  Pmik) 

m—**>  ^ k  !  m  —  oo 


“  Ze-4U'  ^jr-Plk) 

k- n  *  ■ 

-  HU). 


Since  again  the  IFRA  property  is  preserved  in  the  limit,  it  follows  that  Hit)  is  IFRA,  proving 
the  theorem. 


We  emphasize  that  the  IFRA  Closure  Theorem  is  invoked  only  to  show  that  that  Hm„it) 
is  IFRA  The  condition  that  Air)/t  is  nondecreasing  is  needed  so  that  all  components  of  Sm„ 
have  an  IFRA  distribution. 

3.  APPLICATION  OF  THEOREM 

The  condition  that  {PUc)1'*)  is  a  nonincreasing  sequence  is  not  used  in  the  proof  nor  does 
it  appear  in  the  statement  of  Theorem  I.  That  the  condition  is  implicit  is  due  to  a  recent 
remarkable  result  of  Ross,  Shashahani  and  Weiss  [4)  that  (P(k),/kj  is  necessarily  nonincreasing. 
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To  apply  Theorem  1  for  our  purpose  we  must  show 


THEOREM  2:  Let  { /*< Ar )}  be  any  sequence  such  that  0  ^  P(k)  4,  1  and  (/>(k)l/M  is 
nonincreasing.  Then  there  exist  the  monotone  systems  |Sm,„)  such  that  (2),  (3),  and  (4)  hold. 


PROOF:  Let  |P(A))  be  any  sequence  with  the  hypothesized  properties.  Let  F  be  any 
increasing  continuous  distribution  function  over  [0,  °°)  and  (y*)  the  nonincreasing  sequence  of 
nonnegative  numbers  such  that 

F(yk)  -  P(k)Uk.  k  -  1,2,  ... 

For  each  m(m  <  n )  let  Sm„  be  a  set  of  n  components,  i  —  1,  ....  n.  The  cut  sets  are  con¬ 
structed  in  the  following  way.  The  tth  component  has  an  associated  value  x,,  i  —  1,  ....  n 
where  the  values  are  assigned  so  that 

#{/|x,  <  x)  -  l«  F(x)],  0  <  x  <  yt. 


“  «.  x  >  y,. 

where  #  means  "number  of"  and  [  1  is  the  greatest  integer  designator.  Every  set  of  k  com¬ 
ponents  is  a  cut  set  if  k  >  nr,  if  k  <  m  a  set  (i( . ik)  of  components  is  a  cut  set  if  and 

only  if 


max  (x,( . x,()  >  yk. 


Since  ly*l  is  nonincreasing,  „  is,  indeed,  a  monotone  set.  But  here. 


PmJk) 


krl  [nF(yk)\  -  i 

n — — - —  k  < « 


-  0 


k  >  m. 


Thus, 


Pm(k)  =  lim  P„.„(k ) 

n  —  oo 

|  Fk(yk),  if  k  ^  m 
~  \  0  .if  k  >  m 


and 


lim  PJk)  -  Fk(yk) 

V  —  oo 

-  P(k)  .  k  -  1.2 . 


This  proves  Theorem  2. 


Theorems  one  and  two  yield  the  slightly  more  general  version  of  Theorem  3.6  |1,  p  93] 

The  Ross  (2]  generalization  follows  by  defining  the  cut  sets  to  be  determined  by  a  nonde¬ 
creasing  symmetric  function  D(x,,  ....  x*),  i.e  ,  a  set  /, . in  of  components  is  a  cut  set  of 

Sm„  if  k  >  m  or,  if  k  ^  m,  when  £>(x,( . x*t)  >  y,  a  given  threshold  value.  From  the 

construction  of  Theorem  2,  Theorem  1  and  the  result  referred  to  in  [4]  it  follows  that  the 

sequence  (PUc)]  of  this  model  satisfies  the  monotonicity  condition.  For  the  special  case  of 
*  _ 

D(X | . Xk )  -  J7.  X,,  it  is  known  that  the  sequence  {/>(<c)l/‘|  is  nonincreasing  (see  { 1  j  p. 

96). 
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Defense  Systems  Management  College 
Military  Reservist  Utilization  Program 


Military  reservists  from  ali  U.S.  Services  now  have  a  unique  opportunity  for  a  short  tour 
at  the  Defense  Systems  Management  College,  Ft.  Belvoir  Virginia.  By  volunteering  for  the 
Reservist  Utilization  Program,  an  individual  can  increase  proficiency  training,  maintain  currency 
in  DOD  Research,  Development  &  Acquisition  Policy,  contribute  to  the  development  and  for¬ 
mulation  of  concepts  that  may  become  the  bases  of  future  DOD  policy  and  help  solve  critical 
problems  facing  the  acquisition  community. 

Once  accepted  for  the  program,  a  reservist  may  be  assigned  to  one  of  three  areas: 
research,  education  or  operations.  As  a  research  associate,  the  individual  researches  and 
analyzes  an  area  compatible  with  his  training  and  experience.  Many  reservists  in  this  category 
currently  assist  in  the  preparation  of  material  for  a  comprehensive  textbook  on  systems  acqusi- 
tion  The  text  will  be  used  at  DSMC  by  the  faculty  and  students  as  well  as  by  the  systems 
acquisition  community  As  an  academic  consultant,  a  reservist  provides  special  assistance  to 
the  College  faculty  by  reviewing  course  material  in  his  area  of  expertise  and  researching  and 
developing  training  materials  In  the  operations/administration  category,  reservists  administer 
the  program  by  recruiting  other  reservists  for  the  program,  processing  these  reservists,  and 
maintaining  files  and  records. 

Because  of  the  complexity  and  broad  scope  of  the  systems  acquisition  business,  the  Reser¬ 
vist  Utilization  Program  requires  a  large  number  of  reservists  from  many  diverse  career  fields 
.Some  examples  of  career  fields  used  include:  engineering,  procurement,  manufacturing,  legal, 
financial,  personnel,  administration  and  logistics  Reservists  whose  reserve  duty  assignments 
are  not  in  these  types  of  career  fields,  but  who  have  civilian  experience  in  these  areas,  are  also 
urged  to  apply. 

Many  reservists  perform  their  annual  tours  with  the  Reservist  Utilization  Program  office 
Others  perform  special  tours  of  active  duty  or  "mandays  "  When  tour  dales  are  determined  and 
coordinated  with  your  organization  and  the  RUP  office,  submit  the  proper  forms  through  your 
reserve  organization  at  least  45  days  prior  to  the  tour  date  for  an  annual  tour  or  60  days  lor  a 
special  tour 

To  apply  for  active  duty  or  to  get  additional  information,  telephone  Professor  Fred  I- 
Rosell.  Jr.  at  commerical  (703)  664-5783  or  AUTOVON  354-5783  Reservists  outside  of  Vir¬ 
ginia  may  call  on  toll-free  number  (800)  336-3095  ext.  5783 
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CORRIGENDUM: 

STOCHASTIC  CONTROL  OF  QUEUEING  SYSTEMS 

Dr.  A  Laurinavicius  of  the  Institute  of  Physical  and  Technical  Problems  of  Energetics. 
Academy  of  Sciences,  Lithuania,  USSR,  has  pointed  out  an  error  in  the  statement  of  Theorem 
1  of  this  paper  (1|.  The  expression  for  the  generator  given  there  is  valid  only  lor  x  >  0.  and  a 
different  expression  holds  for  x  =  0,  the  proof  for  this  case  being  similar  Moreover,  the 
domain  of  the  generator  can  be  extended  The  correct  statement  is  as  follows 


THEOREM  1:  Let  the  function  J'(i.x)  be  continuous  and  such  that  the  directional  deriva- 

Dr,  ru.x)  =  lim  f(l  +  h-x  -h)~  jUx)-  (X  >  01 

h  *0  +  h 

Dv  /(/,  0)  =  lim  +  ,u0)  ~  J--0)  =  ~  /(/,  0) 

1  '  /i— o+  h  at 

where  P  =  (1,  -  1)  and  Q  =  (1,0),  exist,  be  continuous  from  one  side  and  bounded  Then 
the  infinitesimal  generator  of  the  semigroup  ( 7~, }  is  given  by 

Af'(t.x)  =  DpKt.x)  -  X  /  (l,x)  +  X  f  /(r..v  +  v)  fl  tr/v)  for  x  >  0 

=  Dq  fit. 0)  -  kfit. 0)  +  X  J  fU.v)B(av)  for  x  =  0. 


tives 

(1) 

(2) 


(3) 


As  a  consequence  of  this  error  the  example  of  Section  3  does  not  lead  to  the  stated  result. 
A  correct  example  is  provided  by  the  following.  Let  r(r),  the  revenue  per  unit  time,  and  c(t). 
the  operating  cost  per  unit  time,  be  given  by 

r(t)  =  r  for  0  ^  /  <  r(),  and  =  0  for  t  >  t0 

c(t)  =  C|  for  0  <  t  <  r0,  and  =  c2  for  i  >  t0. 

The  profit  from  operating  the  system  up  to  a  time  Tis  given  by  f(T,  Wr)>  where 

(4)  fU.x)  =  r  minU,t0)  _  C|f0  -  c2  max  (O.f  +  x  —  t n). 

This  leads  to  the  following  correct  version  of  Theorem  3. 

THEOREM  3:  Let  TF,>  =  k •  <  t0  and  assume  that 

(5)  Xct  f  11  -  S(v))dv  <  r  <  Xc2/3 

where  /3  is  the  mean  service  time.  Then  the  optimal  time  is  given  by 

(6)  T„  =  inf(r  >  0:  t  +  W,  3*  a) 
where  a  is  the  unique  solution  of  the  equation 

(7)  Xc2  f  (1  -  8(v)]dv  -  r. 

J'a  " 
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CORRKitNDUM 


PROOF:  It  is  found  that  for  x  >  0 

f  [/(r,x  +  v)  -  f(t,x)\B(dx)  -  -c2  f  .11-  fl(v)]dv 
*'0-  + 

where  (r0  -  t  -  x)  +  =  max  (0,rfl  -  t  -  x).  Also, 

Dp  f(l,x )  =  r  for  t  <  t0,  and  —  0  for  /  >  iu  (x  >  0) 


D@  /(r.0)  =  r  for  t  <  i0,  and  —  — c2  for  t  >  l0. 

Therefore,  the  generator  in  this  case  is  given  by 

Af(t,x)  =  r  —  kc2  f  [1  -  B(v)]dv  for  t  <  r0,  x  ^  0 

^  (lg-1-JC)* 

=  — A2/3  for  />/(>,  x  >  0 

^  =  —  c2  —  \C2/3  for  f  >  t0,  x  =  0. 

In  applying  Theorem  2  we  note  that  Af(t,x)  <  0  for  t  ^  /0,  x  >  0,  so  it  suffices  to  consider 
T/(/,x)  for  t  <  t0,  x  >  0.  We  can  write 

-4/0, x)  •=  0(/  +  x)  for  /  <  r0,  x  >  0, 


<Mf)  =  r  —  Xc2  [  [1  —  i?(v)]</v. 

<M0)  ”  r  —  Ac,  f  (1  —  B(v)]rfv  >  r  -  kc2  f  [1  —  fl(v)ldv  >  0 
<t>(t0)  “  r  —  Xcj/3  <  0 

on  account  of  (5).  Also,  <f>(t)  is  a  decreasing  function  of  t.  Therefore,  there  exists  a  unique 
value  a  such  that  <f>(t)  >  0  for  0  <  t  <  a  and  </>(/)<  0  for  a  <  t  ^  r0.  Since  <Mf)  <  0  for 
i  >  /q,  we  have  4>(t)  <  0  for  /  >  a.  This  means  that  ,4/(/,x)  ^  0  for  r  +  x  >  a,  so  the  set 
/?  of  Theorem  2  is  given  by  R  —  (0.x):  r  +  x  ^  a},  and  the  time  of  the  first  visit  to  R  is 
given  by  (6).  Since  the  process  t  +  IV,  is  monotone  nondecreasing  with  probability  one,  the 
set  R  is  closed.  Moreover,  Ta  <  a  with  probability  one  and  also  E!  T0)  <  Thus,  the  condi¬ 
tions  of  Theorem  2  are  satisfied,  and  Ta  is  optimal  at  IV0  —  w,  as  was  required  to  be  proved. 


where 

(9) 

We  have 


A  particular  case.  Let  B(x)  —  1  —  e  (x  ^  0,  0  <  n  <  «>).  The  conditions  (5)  reduce 


to 

(10) 

and  the  Equation  (7)  gives 


w  <  1 0 - log 

P 


(11) 


1  . 

a  -  f„ - log 

P 


^c2 

pr  \ 

pr 


<  h 


On  account  of  (11)  we  have  a  >  tv. 
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